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Editorial Note 


It gives me immense satisfaction that we could assemble a group of distinguished 
scholars at this Institute to conduct a seminar on the history of mathematics in India, 
leading to the publication of this volume. This whole endeavour owes very much to 
the initiative and commitment of David Mumford. I am very glad that this provided 
also an occasion for having amidst us the renowned indologist Frits Staal. 

It is fitting that this volume is dedicated to the memory of the two outstanding 
scholars David Pingree and K.V. Sarma who have made pioneering contributions 
to this field. 

It is my pleasant duty to thank all those who have contributed to this volume, 
as well as to the one invisible contributor Jayant Shah (Northeastern University, 
Boston) for his critical reading of all the material in this volume. 


C.S. Seshadri 
Director 
Chennai Mathematical Institute 
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Preface 


It was a great privilege to participate in the Seminar Topics in the History of Indian 
and Western Mathematics which iook place at the Chennai Mathematical Institute in 
January and February 2008. I am a newcomer in this area and cannot read Sanskrit, 
but I had the opportunity to listen and learn from Sanskrit scholars, mostly Indian but 
one Western (Staal) and absorb their various perspectives. It has been very exciting 
to me over the last five years to learn something of the distinctive Indian approach 
to mathematics, from its beginnings in Vedic times to its wonderful achievements 
in algebra and the Calculus, just prior to the waves of Western invasions starting in 
the 16" century. 

Today, there is a major resurgence of scholarship in Indian mathematics and 
astronomy both in India and the West, led, on the one hand, by a widespread wave 
of renewed interest in India in Sanskrit studies and on the other hand, by the school 
created by my late colleague David Pingree. Until recently, nearly all articles and 
books on the History of Indian Mathematics and Astronomy were nearly impossible 
to find in the West. But now, new translations and critical editions with commentary 
of many of the extant manuscripts are being published and widely distributed inthe 
West as well as in India. 

This volume contains analyses of many of the most important topics in Indian 
Mathematics and Astronomy, taken from talks at this seminar. Let me sketch some 
of the ideas from each chapter, highlighting topics which seemed to me especially 
significant. I will follow a roughly chronological order. 

Dani's chapter deals with the oldest extant Indian mathematical works, the 
Sulbasütras, or “Rules of the Cord", which date from as early as c.800 BCE. These 
are manuals on the geometry needed forerecting the fire altars central to Vedic ritual. 
Dani describes in detail their techniques of geometric algebra, their construction 
for a square whose area is the sum of those of two given squares or whose area is 
the same as that of a given rectangle. These anticipate many of the constructions 
which appear later in Book II of Euclid's Elements, with knotted cords replacing 
| the Greek’s use of straight edge and compass. In particular, “Pythagoras’s” theorem 
is described here well before Pythagoras. (But note that it also occurs as early as 
c.1800 BCE on Babylonian tablets.) Dani then goes on to describe their approximate 
constructions of circles whose area is that of a given square and vice versa. His last 
section deals with the rational approximation given for the square root of two given 
| in three of the four Sulbasütras. Like the Babylonian approximation, it is accurate 

to roughly one part in a million. Dani describes a striking geometric method, first 
| proposed by Datta, by which this approximation might have been found. But he also 
proposes that, since they were using very long ropes to actually lay out fire altars, it 
is not impossible that they could have carried out a continued fraction-like algorithm 
using two ropes and repeatedly taking multiples of the shorter away from the longer. 
The idea that such algorithms have a very long history connects to Dutta’s article. 
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Staal—the other Westerner besides myself in this sonia ait E. i d 
of the number zero and proposes à link of the mathematical Zero with the b ic 
zero markers which were invented by the great grammarian Panini. The foun E 
of modern computer science as well as linguistics 80 back to Panini (c.400 BCE). 
Panini invented formal grammar with abstract variables for various parts of an 
utterance and recursive rewrite rules. Sanskrit grammar was well-known to all the 
mathematician-astronomers and is a plausible source for many ideas which were 
later developed in more mathematical ways. Staal focuses on the Sanskrit word 
lopa, "something that does not appear", as a precursor to the idea of zero. As he 
has written in other works, Sanskrit grammar appears to grow out of the precisely 
formalized Vedic rituals. Astonishingly, the Vedas dealt with possible enactments 
of a ritual in which one participant fails to utter some required sentence. The verbal 
root lup- of lopa is used to describe this failure: a zero in the ritual. Thus the original 
zero could be the priest’s lapse. 

Raja Sridharan, R. Sridharan and M. D. Srinivas’s chapter concerns another 
area of Indian mathematics: the combinatorics which was inspired first by the 
study of Sanskrit prosody and later by the study of musical patterns, both tonal 
and rhythmic. Panini was followed by Pingala (c.300 BCE) who studied Sanskrit 
prosody. Sanskrit was traditionally written in verse and memorized. Each line had 
a characteristic pattern of short and long syllables. Pingala devised a way to order 
all patterns of short and long syllables in a line with n syllables, and, using this, 
(i) to compute the number describing each pattern, (ii) to reconstruct the pattern 
from its number and finally (iii) to compute the number of patterns with a fixed 
number of short, respectively, long syllables. The first and second use the binary 
number system and the third involves calculating the binomial coefficients. This 
beautiful foundation led to much further combinatorial work which the authors 
survey. For instance the Fibonacci numbers arose when they asked how many 
sequences have a given total length if long syllables are given length 2 and short 
length l Vielen discovered the Fibonacci numbers and their recursion relation 
in the 7] century (well before Fibonacci!). Recursion seems to be a general theme 
which runs through much of Indian mathematics. Most of their chapter concerns the 
generalization to musical phrases and to musical rhythms, where the combinatorics 
gets more complex and more interesting. This story is beautifully described by the 
authors. y 

My own chapter concerns the introduction of negative numbers both in India, in 
China, in Greece and in modern Europe. The full arithmetic of negati 
appears in Brahma : ki aE gative numbers 

gupta's Brahma-sphuta-siddhanta in the 6" century and pre- 
sumably arose much earlier, maybe even in the accounti M os i 
nee, a 2 ng practices described in 
Kautilya’s Arthasastra. In contrast, the fi isi : 
without hesitation! i , the first place where this is correctly described 
in modem Europe is in Wallis’s Treatise on Algebra in 1685. 


1 
Other Westemers wondered whether (—1) - (—1) might be taken to equal —1. 
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This seems to me a stunning example of how far two mathematical traditions can 
diverge—though converging in the end. 

Dutta’s contribution concerns Indian arithmetic and algebra. More specifically, 
it concerns three algorithms. The first, called kuttaka (the pulverizer) constructs, 
for all positive integers a, b, positive integers x, y such that ax — by equals plus 
or minus the greatest common divisor c of a and b. The first half of the algorithm 
is the same as the Euclidean algorithm, successive subtraction of the less from the 
greater, and the second half works backwards to find x and y—just as we do today. 
This second step was not taken by the Greeks. Kuttaka appears first in rather cryptic 
form in the Aryabhatiya (499 CE) and soon after more explicitly in the works of 
Bhaskara I and Brahmagupta. They use it to construct all positive integral solutions 
of linear equations ax — by = d, a,b,d of positive integers. This had long been 
a concern because of the ancient Vedic method for making sense of the relative 
periods of the day, the lunar month, the year and the periods in planetary motion: 
that at the beginning of the present Kaliyuga, all the planets, the sun and moon were 
all lined up in one spectacular conjunction. 

The second and third algorithms are concerned with “Pell’s” equation x*— Dy? = 
1 which quite plausibly arose in the search for good rational approximations x/y 
to A/D. The second algorithm, due to Brahmagupta and called the Bhavana, is 
equivalent, in modern terms, to the law for multiplication of the algebraic integers 
x - y / D. As they putit, if x? — Dy? = m, u? — Dv? =n, then s = xu + Dyv, t = 
xv + yu solves s? — Dt? = mn. They now played with solutions of the equations 
x? — Dy? = m for various m and several centuries later, they did indeed find 
an algorithm which always finds solutions with m = 1, the Cakravala. Thus 
they had a complete theory of Pell's equation, modulo one point—a proof that 
this worked. It is interesting that the standard proofs in modern texts are non- 
constructive whereas the Indian mathematicians focused instead on seeking on 
constructive methods and never studied non-constructive arguments. Their idea of 
mathematics was closer to that of applied mathematicians and computer scientists 
than that of pure mathematicians. Indeed the siddhantas (treatises) where these 
algorithms were written down were manuals for actually calculating astronomical 
events. 

Ramasubramanian and Srinivas take up the story of Indian work on Calculus. 
I personally feel this is a story which deserves to be much more widely known 
in the West. Their chapter outlines the millennium long history of these discover- 
ies, culminating in the complete analysis of the basic calculus of polynomial and 
trigonometric functions, their integrals and derivatives and power series for sine, 
cosine and arctangent—and, of course, applications of these to astronomy. The path 
they took to this is quite disjoint from the path that was taken, first by Archimedes 
and later by Newton and Leibniz. It is extremely unlikely that any of Archimedes’ 
work on, e.g. the Riemann sum for the integral of sine, made its way to India. 
Instead the Indian work seems to have taken off with their discovery of the second 
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order finite difference equation for sine in the 5 century CE or earlier (the discrete 
analog of the result that sine solves the harmonic equation y" + y = 0.) This appears 
cryptically in the Aryabhatiya, and clearly soon after, as is well described in Section 
5 of Ramasubramanian and Srinivas's chapter. Also, early was an interest in sum- 
ming powers of natural numbers which is described in Section 4. This early work 
revolves around finite differences and the corresponding sums, much like Leibniz's 
starting point for calculus. But the application to astronomy made it clear by the 
10" century that one needed to calculate the “instantaneous velocity" as well as its 
finite difference approximation (see Section 6). In the 12'^ century, Bhaskara II used 
these ideas to rediscover Archimedes' derivation of the formula for the area and 
volume of a sphere. This derivation reduces the problem to computing the integral 
of sine. But, interestingly, as an applied mathematician, he completes the proof by 
numerically summing his sine table—even though he knows quite well that cosine 
differences are sines and could have done it this way. The numerical method was 
apparently more convincing! 

The crowning achievements of this work are due to a major genius who is nearly 
unknown in the West: Madhava, who lived in a village in Kerala in the 14" century. 
Only a small fragment of his work survives, but, fortunately, his and his school's 
work was written up in an unusual expository form in the local language, Malayalam, 
by Jyesthadeva in the 16^ century—still over a century before Newton and Leibniz 
did their work. This book, the Yukti-bhàsá has only now been translated into English 
by K. V. Sarma with commentary by the authors of this chapter and M. S. Sriram. The 
first volume was released with some ceremony during our seminar. The work of the 
Kerala school is described in Part 2 of the present chapter. Let me only mention that 
in addition to deriving the power series for sine and cosine and the ‘Gregory’ series 


their numerical bent appears again in finding a series of ways to estimate the 
remainder so that this becomes a practical tool for calculating z. This led them to 
much more rapidly converging series such as: 


HS P3543 "554-5 


Divakaran's article proposes the compelling thesis that recursion is the central 
theme and technique that runs through all the Indian work in mathematics. He traces 
this to Panini’s grammatical rules and even earlier to Vedic ritual. He points out 
that the whole idea of decimal place-value notation can be seen as a way to describe 
integers recursively. Recursive generation of larger and larger numbers leads to a 
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clear conception of infinitely large and its inverse, to a conception of the infinitely 
small and of the limiting process. The bulk of his article, however, focuses on the 
extensive use of recursion in the Kerala work on Calculus. Like Newton, he sees 
the introduction of power series as an algebraic analog of decimal expansions (x^ 
being the analog 10") and he discusses at length the power series expansion 
l 

LF% 
found in the Yukti-bhásá. He then describes the huge step which one finds in the 
Yukti-bhasa: recursive proofs based on induction: they derive the integral of x" 
using induction on n. Finally, he discusses the derivation of the power series for 
sine and cosine in the Yukti-bhasd and notes that it is derived by first converting the 
known difference equations for sine in terms of cosine and cosine in terms of sine 
into a summation form. This is the exact analog of converting differential equations 
into integral equations. Jyesthadeva, starting with a crude approximation, then 
recursively back-substitutes each approximation into the summation form to get a 
better one. Passing to the limit, he gets the power series. This back substitution is 
exactly what we do today to solve Fredholm integral equations. 

Finally, M. S. Sriram’s Chapter addresses the planetary models in Indian As- 
tronomy. As mentioned above, mathematics was usually studied together with 
astronomy and the two subjects advanced hand in hand. His article concentrates 
on their last model which is due to Nilakantha c.1500 CE, who worked at roughly 
the same time as Copernicus in the West. His model is explained at length in the 
Ganita-yukti-bhàsa and is described with modern formulas in Sriram’s chapter. The 
most interesting issue is here is the historical movement from geocentric models 
to heliocentric models. In fact, Nilakantha’s model is ‘essentially’ heliocentric. In 
his model, one starts with the mean sun moving in the ecliptic circle. Then for each 
planet, one takes a plane centered at the mean sun but inclined to it to different 
amounts for each planet and intersecting it at the appropriate nodes. And in each of 
these planes one places the corresponding planet on the Ptolemaic approximation to 
the ellipse: an eccentric circle. Once one overcomes the dense tangle of compound 
Sanskrit words, a very modern model shines through. 

These articles together cover a substantial portion of the exciting History of 
Indian Mathematics and Astronomy. Only a fraction of this has become generally 
known to mathematicians in the West. Too many people still think that mathematics 
was born in Greece and more or less slumbered until the Renaissance. Therefore, I 
hope that this book may serve as a way of bringing to the international mathematical 
community a deeper knowledge of the riches in Indian Mathematics. To scholars, 
however, there is another message: there is much work still to be done to study, edit 
and translate the many ancient manuscripts still surviving in libraries all over India. 
One hopes for a deeper and broader picture of the more than two millennium long 
history when every one of these has been looked at and analyzed. 


7) 3. 
—l—-x-c4x-—x-4T:-- 


David Mumford 
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David Pingree, 1933-2005 
Kim Plofker 


David Edwin Pingree (2 January 1933-1 1 November 2005) employed the fifty years 
of his scholarly career investigating the development of mathematics, astronomy 
and the related exact sciences from ancient Mesopotamia to early modern Europe 
and India. He published editions, translations and studies of source texts in Akka- 
dian cuneiform, Greek, Latin, Sanskrit, Arabic and Persian, on subjects ranging 
from infinite series and interpolation techniques to astral magic and iconography 
in astrological texts. He was professionally affiliated with Harvard University as 
an undergraduate (B.A. in Classics and Sanskrit, 1954), graduate student (Ph.D. 
in Sanskrit and Indian Studies, 1960), and Junior Fellow (to 1963); the Univer- 
sity of Chicago as a faculty member in the Oriental Institute and Departments of 
History, South Asian Languages, and Near Eastern Languages (1963-1971); and 
Brown University as a professor in the Departments of Classics and the History 
of Mathematics (1971-2005). Over the course of his immensely productive career 
he received many honors, including a Fulbright Scholarship, a Guggenheim Fel- 
lowship, and membership in several learned societies, among them the American 
Academy of Arts and Sciences, the American Philosophical Society, and the Insti- 
tute for Advanced Study at Princeton University. Pingree was awarded the title of 
“Abhinavavarahamihira” by the government of Uttar Pradesh in 1979, and in 1981 
was one of the first recipients of the MacArthur Fellowship (popularly nicknamed 
the “Genius Grant"), together with co-honorees including the philosopher Richard 
Rorty, the paleontologist Stephen Jay Gould, and the computer scientist Stephen 
Wolfram. His total scholarly output comprised several dozen monographs and sev- 
eral hundred research articles, reviews, encyclopedia entries, and other works. 

In his research on the history of science, David Pingree was first and foremost 
what might be termed a “transmissionist”: he was primarily interested in what he 
called the “kinematics” of scientific development, the ways that scientific ideas were 
passed from one culture to another, and how they were transformed in the process. 
The first awakening of his interest in the history of science as a field of research was 
due, as he described it, to just such a question of transmission from the medieval 
Indian exact sciences to their Greek counterpart. As a Fulbright scholar in 1955, 
reading a Byzantine Greek manuscript on astrology in the Vatican Library, he noticed 
marginal notes by a commentator that included technical terms transliterated from 
Sanskrit. This discovery eventually inspired his important studies of the influence 
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of Indian astrology on Arabic and Persian texts and consequently on the Greek 
astral science tradition of Byzantium. Pingree also explored the mirror image of 
this transmission in an earlier era, tracing the assimilation of Hellenistic horoscopic 
astrology into the Sanskrit discipline of yavana-jataka via the pre-Gupta Indo-Greek 
kingdoms in western India. This work, which became his doctoral dissertation and 
his first monograph, led directly to his prolonged collaboration with the great 
Assyriologist historian of science Otto Neugebauer, and to his study of Akkadian, 
Arabic and Persian in order to track similar exchanges of scientific knowledge 
through the entire four-thousand-year history of the great Eurasian development of 
the exact sciences. (Even erudition as deep as Pingree’s was forced to submit to 
some limitations in this quest, however: to the end of his life he remained somewhat 
regretful, and slightly apologetic, that he had never found time to undertake learning 
Chinese.) 

Pingree’s hypotheses about cross-cultural transmission of scientific ideas were 
frequently groundbreaking and sometimes controversial. A few of them were ten- 
tative speculations that never attained wide acceptance, such as his suggestion that 
the structure of Bhaskara Is table of easy Sines (Jaghu-jya) might reflect influence 
from Islamic trigonometry. Some were plausible inferences unproven by conclusive 
textual evidence that were accepted by most mainstream historians but strongly 
resisted by some others, such as his attribution of parts of the Sanskrit Jyotisa- 
vedarigal Vedariga-jyotisa to Babylonian mathematical astronomy techniques in the 
Achaemenid period or his derivation of Aryabhata's planetary mean motions around 
500 CE from a hypothesized Greek source. Some were preliminary conclusions 
based on little-known textual sources that remain to be explored more fully, such 
as his deductions about the relationship of Mesopotamian and ancient Indian astral 
omen literature or about the recognition of Islamic optics in seventeenth-century 
Sanskrit astronomy. But most of his discoveries concerning scientific transmission, 
including his study of the Indian adoption of Hellenistic astrology, his exposition of 
the dependence of medieval Spanish astronomy on Indian sources, and his research 
on the reactions to Latin heliocentrism in eighteenth-century Rajasthan, were solidly 
established by masterly textual scholarship and have substantially transformed the 
standard narrative of scientific development. 

In light of the above-mentioned controversies, Pingree has sometimes been 
rashly relegated (by those acquainted with only a few isolated fragments of his 
work) to the company of reactionary Orientalists like the nineteenth-century John 
Bentley and G. R. Kaye who took it for granted that original scientific discoveries 
were by default, Greek, and that treatises in, say, Arabic or Sanskrit or Chinese 
must represent mere borrowings and imitations. Pingree himself, however, was far 
from sharing this ill-informed view, which he scornfully dismissed as the vice of 
*Hellenophilia" unworthy of a serious historian: 


Hellenophiles, it might be observed, are overwhelmingly Westerners, displaying 
the cultural myopia common in all cultures of the world but, as well, the arrogance 
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that characterized the medieval Christian's recognition of his own infallibility and 
that has now been inherited by our modern priests of science. . . If it is evident that 
for a historian the proposition that the Greeks invented science must be rejected, 
it necessarily follows that they did not discover a unique scientific method. . . 
Babylonian and Indian mathematics are frequently criticized for relying not on 
proofs but on demonstrations. But without axioms and without proofs Indian math- 
ematicians solved indeterminate equations of the second degree and discovered the 
infinite power series for trigonometrical functions centuries before European math- 
ematicians independently reached similar results. .. Those who deny the validity 
of alternative scientific methods must somehow explain how equivalent scientific 
"truths" can be arrived at without Greek methods. And in their denial they clearly 
deprive themselves of an opportunity to understand science more deeply. ( Pingree, 
“Hellenophilia versus the history of science", [sis 83.4 (1992), 554—563) 


It should be noted that the "infinite power series" referred to above were familiar 
to Pingree largely through the Sanskrit editions of the Kerala-school treatises pub- 
lished by his brilliant and indefatigable colleague K. V. Sarma. The historiographic 
approach that they, and the studies in the present work, espouse-namely, the careful 
exploration of the intellectual content of a scientific tradition within its own cultural 
context and in its encounters with other cultures-is what David Pingree recognized 
as the true vocation of a historian of science. 


Department of Mathematics, Union College, Schenectady NY, USA. 
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K. V. Sarma (1919-2005)! 


M. S. Sriram 


Born at Chengannurin Keralaon27" December 1919, Krishna Venkateswara Sarma 
had his school education in Attingal near Thiruvananthapuram. He completed his 
B.Sc. degree with Physics as the major subject in 1940, from Maharaja's College 
of Science, Thiruvanathapuram. His family tradition of Sanskrit scholarship influ- 
enced Sarma to join the M.A. course in Sanskrit at Maharaja's College of Arts, 
Thiruvanathapuram, which he completed with distinction in 1942. During 1943— 
51, he was in charge of the Manuscripts Section of the Kerala University Oriental 
Research Institute and Manuscripts Library. It is here that he acquired expertise in 
deciphering and critically editing palm-leaf and paper manuscripts of Sanskrit and 
Malayalam texts. During this period, he prepared an analytical catalogue of nearly 
50,000 manuscripts of the library. 

From 1951 to 1962, Prof. Sarma was in the Department of Sanskrit, University of 
Madras, where he was associated with the project of compiling the New Catalogus 
Catalogorum of Sanskrit Works and Authors, under the direction of the great San- 
skritist V. Raghavan. It was also the time when his life-long pre-occupation with the 
Kerala school of Astronomy and Mathematics began to take shape and he started 
painstakingly collecting manuscripts on Astronomy, Astrology and Mathematics, 
critically editing and translating many of them. Some of his early publications in this 
genre were Grahacáranibandhana of Haridatta, Siddhantadarpana of Nilakantha, 
Venvaroha of Madhava, Goladipikà and Grahandstaka of Parame$wara. During 
this period, Prof. Sarma also came under the influence of the renowned scholar T. S. 
Kuppanna Sastri, in collaboration with whom he edited the main text of the Vakya 
system, Vakyakarana, with the commentary of Sundararaja. 

At the invitation of Acharya Viswa Bandhu, Prof. Sarma moved in 1962 to 
the Visvesvaranand Institute of Sanskrit and Indological Studies of the Panjab 
University at Hoshiarpur. He served as the Director of the Institute during 1975— 
80 and stayed on at the Institute till 1983. This period of his stay at Hoshiarpur 
was indeed very productive and he published more than 50 books, mostly on the 


! This is based on the obituary which appeared in Ind. Jour. Hist. of Sci. 41(2006), 231-246, which 
also includes a bibliography of publications of K. V. Sarma. 
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Kerala School of Astronomy. These include very important seminal works such as 
Drgganita of Parame$vara, Golasàra of Nilakantha, A History of the Kerala School 
of Hindu Astronomy, Lilàvati of Bhaskaracarya with Kriyakramakari of Sankara 
and Narayana, Tantrasangraha of Nilakantha with the commentaries Yuktidipika 
and Laghuvivrti of Sankara, Jyotirmimamsa of Nilakantha, and Ganitayuktayah. 

In 1983, Prof. Sarma returned to South India to settle down in Chennai. His 
important publications during this period include: /ndian Astronomy: A Source 
Book jointly with the renowned historian of science B.V.Subbarayappa, Vedanga 
Jyotisa of Lagadha and Paricasiddhantikà of Varahamihira, on which he had worked 
in collaboration with T. S. Kuppanna Sastri. From 1990 onwards, Prof. Sarma 
had been working on a critical edition and English translation of the celebrated 
Malayalam work Ganita-yukti-bhasa of Jyesthadeva (c.1530 AD). He requested 
K. Ramasubramanian, M.D.Srinivas and M.S.Sriram to prepare detailed explana- 
tory notes in English. Prof. Sarma also edited a Sanskrit version of Ganita-yukti- 
bhāsā which appeared in 2004, and compiled an important catalogue, Science 
Texts in Sanskrit in the Manuscripts Repositories of Kerala and Tamil Nadu, 
which includes a list of nearly 3,500 works related to science and technology. 
In fact, he continued to be relentlessly active till his very death (on January 13, 
2005). 

Prof. Sarma has to his credit several publications also on diverse aspects of 
Sanskrit learning such as Vedas, Itihasas and Puranas, Dharmadáastras, etc. In fact, 
he has authored more than 100 books and 500 articles. His outstanding contribution 
consists in searching for and bringing to light many of the seminal works of Kerala 
School of Astronomy, which show that the tradition of Mathematics and Astronomy 
continued to flourish till late middle ages at least in the South of India. They also 
present a detailed view of the methodology of these sciences, on issues such as 
justification of mathematical and astronomical results and procedures, and the 
importance of continuous examination and revision of planetary theories. It is 
mainly due to Prof. Sarma’s painstaking work on primary sources that the work of 
the Kerala School has been brought to the attention of historians of Mathematics, 
and opened a new perspective on Indian contributions during the late medieval 
period. 

Just as in the case of his illustrious predecessors such as Bibhutibhushan Datta, 
Avadhesh Narayan Singh and others, and his own contemporary and collaborator 
K.S.Shukla, Prof. Sarma did not receive even in his own country, the recognition 
and accolade, which he richly deserved. He was of course awarded the D.Litt degree 
of Panjab University in 1977 and, in 1992, was bestowed the Certificate of Honour 
by the President of India. He was also conferred the honorary degree of Vacaspati 
by Kendriya Samskrita Vidyapeetham, Tirupati, in 2003. 

Prof. Sarma, with the wish that his legacy should continue, founded the Sree 
Sarada Education Society & Research Centre during the 90s, donating his lifetime 
savings and invaluable collection of books and manuscripts; since his demise, 
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the Centre has published critical editions of some of the texts that he had been 
working on during his last days. One hopes that the Centre will receive support and 
encouragement from scholars and funding agencies so as to sustain the torch that 
Prof. Sarma lit. 


Department of Theoretical Physics, University of Madras. 
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Geometry in the Sulvasütras 


S. G. Dani 


Sulvasütras are compositions pertaining to the fire rituals performed by the Vedic Indians. The 
rituals involved constructions of altars and fireplaces in a variety of shapes, involving geometric 
theory. Some of the theory is explicitly enunciated, while some other aspects of the knowledge at 
that time can be inferred from the constructions. We present here an overview of the geometric 
ideas contained in the Sulvasütras. 


Yajnas, or fire rituals, formed an integral part of life in the Vedic culture, going 
back to 1500 BCE or earlier, and extending until about the sixth century BCE. 
Some of these concerned sacrifices to be performed regularly by a householder 
(grhastha), while performance of certain others was prescribed for bringing about 
fulfilment of specific aims or desires, which included both material (acquiring 
cows, vanquishing an enemy etc.) and transcendental (securing place in heaven) 
aspects. 

In view of the great significance attached to the yajnas meticulous attention was 
paid to a variety of details in their planning and execution. The rituals involved 
construction of altars (vedi) and fireplaces (agni) in a variety of intricate shapes. 
Over a period the procedures for construction of the shapes appear to have got 
more formalised and acquired a degree of sophistication in geometrical terms. The 
Sulvasütras mark the peak in the geometrisation of the altar building activity of the 
Vedic era. 

The Sulvasütras are often referred to as *manuals' for construction of the altars 
and fireplaces. While there is a certain valid analogy here, it should be borne in 
mind however that the contents are not limited to prescribing steps or procedures 
for the construction of the altars and fireplaces. They also describe various geo- 
metric principles involved, and set up a body of geometric ideas. In Baudhayana 
and Apastamba Sulvasütra there are separate sections devoted to geometric 
theory. 

The Vedic people were a heterogeneous community, with many sSakhas 
(branches), having nevertheless a common cultural identity. The different sakhas 
had their versions of Sulvasütras, transmitted orally from generation to genera- 
tion within the community (branch). There are nine extant Sulvasütras of which 
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four, Baudhayana, Apastamba, Manava and Katyayana Sulvasütras are of signifi- 
cance from a mathematical point of view. The dates of the Sulvasütras are uncer- 
tain, but it is generally believed that they were composed sometime during the 
period 800 — 200 BCE; for the individual Sulvasütras the ranges would be, 
Baudhayana (800 — 500 BCE), Apastamba and Manava (650 — 300 BCE) and 
Katyayana (300 BCE — 400 CE), according to Kashikar, as quoted in [11]. 

The root sulv means ‘to measure’, and the name Sulvasütra would correspond 
to “theory of mensuration” (see [4]). The word sulva also means ‘rope’ in Sanskrit, 
which indeed was a major equipment employed in measurements. In the body of 
the Sulvasütras the word sulva does not appear; instead the word rajju is used for 
rope. On the other hand, while most measurements involved in the constructions 
were indeed carried out with ropes, there are instances where a bamboo rod was 
used instead. This suggests that the name Sulvasütras indeed was meant to convey 
mensuration in the conceptual sense, and not only as operations with the rope. The 
meaning of Sulva as rope is presumably a later development. 

Like other Vedarigas (appendages of the Vedas) the Sulvasütras are composed in 
the sūtra (aphoristic) style, characterised by short sentences with nouns often com- 
pounded at great length and verbs avoided as much as possible, rather than running 
prose, presumably for reasons of convenience in reciting them. The text, which is in 
prose form in other respects, has been divided by later commentators into convenient 
segments, treated as individual sitras, and grouped into Chapters. As presented in 
[19] (the scheme which we shall follow in the sequel for reference), Baudhayana has 
21 Chapters adding to 285 sütras, Apastamba has 21 Chapters adding to 202 sütras, 
Manava has 16 Chapters adding to 228 sutras, and Katyáyana has 6 Chapters adding 
to 67 sütras. 

There is a considerable overlap in the contents of the different Sulvasütras in- 
dicating that the works are expositions from a common stream of knowledge. 
There are also significant differences, which may be attributed to the different 
branches they come from and the difference in their period. For general refer- 
ence in this respect the reader is referred to [19]; (see also [1], [6], [13], [14] 
and [16]). 

The contents of the Sulvasütras can be broadly categorised into two groups, 
one consisting of Geometric theory, and another dealing with various details about 
the constructions of various vedis and agnis; in Baudhayana and Apastamba the 
sections dealing Geometric theory are arranged in the beginning. From the other 
part also one can draw some inferences about the geometric knowledge at that 
time.-The preponderant aspect in this part however is the description of the *Ar- 
chitecture" of tilings involved in the construction of complex figures needed for 
the vedis and agnis; (Katyayana Sulvasütra however consists mostly of theory 
part - this also explains its being shorter than the others). The designs of some 
of the special fireplaces involve elaborate figures resembling falcons and other 
birds, tortoise, chariot wheels, circular trough (with a handle), pyre, etc., whose 
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constructions are described, quite elaborately, in the form of tiling by bricks in 
certain primary shapes. Apart from the issue of achieving likeness with the de- 
sired figure, in terms of rectilinear constructions, there are also other stipulations 
involved, such as the number of bricks to be used etc.. On account of these there 
are also some arithmetical and combinatorial features involved, in a rather scattered 
form, in the architectural description of the layout of the fire altars. A study of 
this aspect would be of interest. We shall however not concern ourselves with it 
here, and will confine to geometry in the Sulvasütras, including the principles and 
constructions described explicitly, as well as those which can be seen to be involved 
implicitly. 

The geometric contents from the Sulvasütras will be discussed in the following 
sections taking up various themes. Before going over to the main contents, a few 
words would be in order regarding the units of measurement involved. They had 
various units for measures of lengths. Measures of many of the altars are given 
in terms of purusa (meaning man), which was about 74 feet, stipulated as the 
height attained by the performer of the sacrifice, yajamána, with uplifted arms. 
A commonly occurring small unit is arigula (meaning finger, in width); purusa 
comprised of 120 arigulas, so an arigula was about 3 th of an inch. pada (meaning 
foot, given as 15 arigulas in Baudhayana and 12 arigulas in Katyayana), pradesa 
(12 arigula), aratni (24 argula) are some of the other length measures that occur 
frequently. 

The Baudhayana Śulvasūtra gives in the beginning (sütra 1.3) names of 18 
different units of length measure. The smallest among them, tila (sesame seed) 


is 3i of an arigula. It was postulated by Thibaut (see [20], page 15) that the unit 


owes its origin to the fact that they had a formula for 4/2 involving the fraction LE 
(the formula will be discussed later). Many of the intermediate units do not bear a 
simple fractional relation with purusa however; e.g. a bahu is 36 arigulas, a yuga 
is 86 arigulas, etc.. The units must have arisen from the context of performance of 
specific vedis and many of them occur infrequently. 

The other Sulvasütras use many of the units described by Baudhayana but there 
is no systematic listing or a comprehensive statement on their interrelations as 
in Baudhayana Sulvasütra. Some other measures, vitasti, ürvasthi, anüka are also 
mentioned in Apastamba Sulvasütra. 

For the area of rectilinear figures they had the notion as we have today. They 
were aware that for similar figures the ratio of the areas equals the square of the 
ratio of the lengths of the corresponding sides, as is clear from usage of the idea 
at various places; in Apastamba Sulvasütra there is also an elucidation of this with 
some examples, including with fractional sides 14 and 23 (sutras 3.6 to 3.9). The 
Square units and the corresponding linear unit were known by the same name, the 
meaning being understood from the context; e.g. purusa could mean the unit of 
length as well as the area of the square with that length (“square purusa”, so to 
speak), depending on whether it referred to length or area; we shall also adopt this 
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as a convention in the sequel when referring to the Sulvasütras units, rather than 
prefixing the term "square". 


1. Construction of Rectilinear Figures 


Though inevitably there are some 3-dimensional features to the fireplaces described 
inthe Sulvasütras, the geometric ideas of significance chiefly concern planar geom- 
etry. These involve the concepts and construction of rectilinear figures such as 
Squares, rectangles, symmetric (isosceles) trapezia and triangles, rhombuses, as 
well as circles, as primary figures. 

The rectilinear figures sought to be drawn had a bilateral symmetry; viz. isosceles 
triangles, symmetric trapezia, rectangles. The east-west line served as the line of 
symmetry. Towards construction of these figures with prescribed sizes for the sides, 
the sutras principally describe steps to draw perpendiculars to the line of symmetry; 
these are however packaged into complete procedures for drawing the desired 
figures, as may be seen in some examples discussed below. The issue of drawing a 
perpendicular to a given line at a given point on it is addressed in the Sulvasütras in 
two essentially different ways, involving the following principles (described here 
in modern formulation): 


e giventwocircles with equal radii intersecting each other, the line joining their 
(two) points of intersection is perpendicular to the line joining their centres, 
at the midpoint of the line segment joining the centres. 


e (converse of Pythagoras theorem) in a triangle with sides with lengths a, b, c 


ifc? = a? + D? then the sides with lengths a and b are perpendicular to each 
other. 


These principles are ot enunciated in the Sulvasütras, though they are implicitly 
at work in their constructions (see however the discussion at the end of $2). A large 
number of sütras describe constructions of squares and trapezia via application of 
one of the above, for various specific given sizes, as we shall see in some detail 
below. 

The first statement as above is of course what we commonly use for drawing 
perpendiculars, to a given line at a given point, in Euclidean geometry: take two 
points equidistant from the given point and draw two intersecting circles with these 
as centres and join the points of intersection. This procedure is involved in various 
constructions in Sulvasütras for drawing perpendiculars in the same way as we now 
do. It is however not isolated as a procedure for drawing perpendiculars, but forms a 
part of the package prescribed for construction of various figures (in the framework 
as indicated above): thus a construction of a square in Baudhayana (sutras I.22 to 
1.28 [19] consists of the following steps, described as an aggregate (see Figure 1): 


i) take a rope of the desired side of the square and mark the midpoint; 
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ii) place a pole at the desired midpoint (say at P as in Figure la) and tying the 
ends of the rope to it draw the circle around it by the mark (at the midpoint) 
and place poles at the points where the circle meets the east-west line, (A 
and B as in Figure 1); 


Figures la and 1b: Beginning of Baudhayana construction of a square 


iii) draw circles with centres at points A and B by the length of the rope, and 
mark the points where the line joining their points of intersection meets the 
original circle, to obtain the north-south line and mark the points where the 
line meets the circle as in (ii) (C and D as in Figure 1b); 


(iv) draw circles by the mid-point on the rope, tieing both its ends at A and then 
at D and mark the point where they meet (Q as in Figure 1d); this is one of 
the vertices of the square, and the others can be obtained similarly. 


Notice that the first three steps are designed to draw the perpendicular bisector to 
the line of symmetry and marking the midpoints of the sides of the desired square, 
and in (iv) these are used to produce the vertices of the square. 

Other constructions are also described in a similar vein, as a package for produc- 
ing the intended figure, without reference to steps involved in each other. 

The procedure for drawing the circles involved in the above construction is by 
tying one end of a rope to a pole placed at the point chosen to be the centre and 
tracking the point at a distance equal to the selected radius; this plays the role of the 
compass as used in school geometry now. 

The same principle as above was also used with a rope in another way (see 
Figure 2): given a line and a point on it, say P, to draw the perpendicular to the line 
at P, take two points on the line equidistant from P, on either side of the line, affix 
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Figures 1c and 1d: Completion of construction of the square 


poles at the two points and tie a rope (loosely) at the two poles; then stretching the 
rope holding at its midpoint, points are marked on either side where it lies on the 
ground; the line joining these two points is the desired perpendicular to the original 
lineat P. 

It may be seen that though the procedure is different the same principle underlies 
this construction. This variation for producing perpendiculars also appears as part 
of construction of some rectilinear figures; in particular Baudhayana construction 
of a rectangle given in sutras 1.36 to 1.41 adopts this procedure. 

Though these methods based on the orthogonality principle as above have been 
used in various constructions, on the whole the sztraküras show greater predilection 
towards using the other principle, namely the converse of Pythagoras theorem. It 
is believed that the Egyptians also used triangles with sides 3, 4 and 5 (in some 
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units) for construction of perpendiculars, but this has now been discounted (see [8], 
Appendix 5). It is not clear whether there is any other instance historically of using 
converse of Pythagoras theorem for the purpose of drawing perpendiculars. 

Let me now describe in some detail how this application was made, and discuss 
the possible convenience for which it was preferred. Take a rope, with endpoints 
marked P and Q, with length a (see Figure 3). Let R be the midpoint of P Q (on the 
string), $ the midpoint of QR and X the midpoint of RS. Now tie the two ends of 
the string to two poles placed a distance a/2 along the line to which a perpendicular 
is to be drawn, with the Q end at the point where the perpendicular is to be drawn. 
Let A and B be the points on the plane (ground) where the ends P and Q are tied. 
Now stretch the rope, holding it at the point X as above, on one side of the line A B, 
and mark the point on the plane where X lies, say C (see Figure 3(c)). Notice that 
B and C are at a distance 3a/8 and A and C are at a distance 5a/8. Thus the sides 
BC, AB and AC are in the proportion 3 : 4 : 5, and since 3? + 4? = 5?, by the 
converse of the Pythagoras theorem the angle Z A BC is a right angle. We have thus 
constructed a perpendicular to the line A B at the point B. 


B 


Figures 3a, 3b and 3c: Construction of perpendicular by the Nyanchana method 


m 
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A rope with markings as above, which can be preserved, can thus be used as an 
instrument to draw a perpendicular, essentially at one stroke; the distance a/2 to 
be kept between A and B is also available by a mark on the rope as the distance 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


16 S. G. Dani 


between P and R. In directing a yajamana towards drawing a perpendicular (as a 
step in the construction of the vedi), it would be simpler for the priest to use this 
approach, than those with the orthogonality principle discussed above. 

The above procedure depends on the fact that a triangle with sides in the pro- 
portion 3 : 4 : 5 is a right angled triangle, by the converse of the Pythagoras 
theorem. The Sulvasütras describe also analogous procedures using in place of 
(3, 4, 5) other triples (a, b, c) such that a? + b? = c?; typically they are triples of 
integers, that we now call Pythagorean triples, and their multiples by a fraction, 
but occasionally some incommensurable triples are also involved. As in the above 
procedure it involves marking a point X so that when the rope is stretched holding 
at that point we would get a right angled triangle. Such a point is called Nyanchana; 
Nyanchana means “lying with face downwards” and in this context signifies that 
the marked point is to be plotted on the ground. A procedure involving use of the 
triple (5, 12, 13) goes as follows: having chosen a distance a between the poles, a 
rope of length one and half times the measure is taken (thus extending the rope by 
a/2), and the Nyanchana mark is set at a distance a sixth of the extended piece, 
namely a/12 from the joining point. The Nyanchana mark then divides the string 
in the proportion 5 : 13, and steps analogous to those described above will yield a 
perpendicular at the pole on the side of the shorter segment. It may be noted that the 
Pythagorean triple is unrelated to the length a in either case. 

In [3] I have discussed the theme of Pythagorean triples with regard to the 
Sulvasütras. Here I will therefore introduce it only briefly to put the topic in per- 
spective. 

The main role of the Pythagorean triples in Sulvasitras was their use in producing 
perpendiculars via the converse of Pythagoras theorem. The two triples (3, 4, 5) and 
(5, 12, 13) are a common occurrence in this respect in the Sulvasitras. These are 
primitive triples (there is no common integer factor greater than 1). Some multiples 
of these triples (non-primitive) were also commonly in use; the triple (15, 36, 39) 
seems to have been an especially familiar one, and perhaps much older than the 
Sulvasütras themselves (see [3] for some observations on this). In Apastamba two 
more primitive Pythagorean triples occur in the description of the the construction 
of the Mahavedi: (8, 15, 17) and (12, 35, 37) (Asl. 5.3 - 5.5); this is the only place 
where they occur in Apastamba Sulvasütra. 

The Mahavedi was in the shape of a symmetric trapezium with a base of 30 units, 
height of 36 units and face (side opposite to the base) of 24 units (see Figure 4); to 
give an idea of the physical size (though it shall not concern us further) it may be 
mentioned that the unit involved is either a pada or a prakrama, the latter being ¿th 
of purusa, and the height then works out to be about 20 meters. 

Apastamba gives four constructions for the Mahdvedi all based on the Nyan- 
chana method as discussed above. The first one known as the ekarajjuvidhi (“one- 
rope process") involves a rope of length 54 units, with markings at 36 and 12 units 
from the two ends respectively and a mark at the midpoint of the remaining middle 
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Figure 4: The Mahàvedi 


portion of 6 units as the Nyanchana mark. Thus we have a subdivision into 15 and 
39 units, and the desired height being 36 units this enables fixing the two vertices A 
and B at the base by the Nyanchana method (through triangles EWA and EWB. 
The vertices at the face C D are also obtained in the same way, by plotting the mark 
at 12 from the end rather than the original Nyanchana mark. The other constructions 
involve two cords. 

It may be noticed that the diagonals of the trapezium meet at a point M which 
together with the base and face makes isosceles triangles AM B and C M D whose 
symmetric half parts have lengths given by the triples (15, 20, 25) and (12, 16, 20) 
respectively. Apastamba's second procedure consists of drawing the two triangles 
using that these are multiples of the triple (3, 4, 5). 

The third construction is based on the triple (5, 12, 13) and its multiple 
(15, 36, 39), using that half the base and half the face, viz. 15 and 12, occur in 
these triples; unlike in the first two constructions one of the vertices of the right 
angled triangle involved (the one on the east-west line) is of no significance to 
the diagram itself. It is the last construction in which the triples (8, 15, 17) and 
(12, 35, 37) appear. Again since 15 and 12 occur in these, the vertices at the base 
and the face may be plotted with these triples, respectively, with poles at distances 
of 8 and 35 from the base and face respectively. Manifestly the construction would 
no longer be as elegant as the earlier ones, and Apastamba could not have missed 
noticing that. From the overall context it is clear that the aim has been to quench 
the curiosity about the various ways the Nyanchana method could be used to plot 
the vertices, and no practical requirement is involved. This involves finding triples 
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with 15 and 12 as one of the first two entries. Incidentally, Apastamba exhausts 
such triples. There is no clue however whether this was known, and if so how it was 
realised. 

In Baudhàyana, while only the first two primitive triples, viz. (3, 4, 5) and 
(5, 12, 13), are found used in the constructions, there is a list of 5 primitive 
Pythagorean triples (or rather the first two terms of each, which of course de- 
termines the third term) given, following the statement of Pythagoras theorem. 
Apart from the four triples as in the above discussion the triple (7, 24, 25) also 
forms part of the list; the (non-primitive) triple (15, 36, 39) is also included along 
with the others, which is possibly due to the familiarity with it in a wider context, 
and its association with tradition on account of its being involved in the Mahavedi 
(see [3] for a discussion on this). The wording of the sütra, its location and the 
overall context indicate that the listed triples are given as illustrative examples for 
the Pythagoras theorem.! 

There are no other primitive Pythagorean triples found in the Sulvasütras. It 
would seem that though they may have the means of producing more triples, if not 
an infinite family, they would have had no motivation for it, with the ones that occur 
having specific objectives that are adequately met in their context. 

The Nyanchana procedure was also used in a construction with an incommen- 
surable triple (not a multiple of Pythagorean triples); the construction is also in- 
teresting from another point of view (see below). To construct a square of side a 
Apastamba gives the following procedure: on a rope mark three points, two end- 
points and a point in between whose distance from one endpoint is a/2, and from 
the other endpoint it is equal to the length of the diagonal of the square with side 
a/2. The rope is now tied to two poles, one at the desired centre of the square 
and the other at the point at distance a/2 on the east-west line, by the endpoints 
as above, with the longer side (from the in between mark) being attached to the 
centre. The rope is then stretched, holding it at the middle marked point. Where it 
lays on the ground is one of the vertices of the desired square; the other vertices 
are plotted similarly. The procedure presupposes being able to mark a point at a 
distance equal to the diagonal of a square of side a/2, so the construction is in a way 
“circular”, from the point of view of logical development. However, the diagonal 
of a square was such a common occurrence in their practice that it was I suppose 
treated as a “tangible” quantity. Having once constructed a square one could produce 
ropes with markings as required, and then employ them for later construction of 
squares following the above procedure. It has been suggested by some authors (see 
Footnote 4) that an approximate numerical expression for /2 was used to get the 


! Indeed, a Pythagorean triple does not, strictly speaking, illustrate Pythagoras theorem, since one 
would need to know that the triangle corresponding to the triple as side-lengths is a right angled triangle; 
the nuance of the illustration here would be more like “when you draw rectangles with sides 3 and 4, 5 
and 12, . . . , the diagonals will be 5, 13, . . . , and you see that the area produced by the diagonal is the 
sum of the that of the squares on the sides." 
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diagonal of the square with side a/2; this however seems unlikely, considering the 
common usage of ropes all around, which furthermore would give a more accurate 
measure, without the cumbersome subdivision that would be involved in producing 
the approximate value using the formula for 4/2 (see $6). 

For the constructions of the figures the Sulvasütras adopted meticulous proce- 
dures, involving drawing the perpendiculars, which was accomplished by the meth- 
ods that we discussed above. Surprisingly, despite extensive use of these methods, 
conceptualisation of the perpendicular or right angle seems to have eluded them. 
Absence of the concept may have led to view the task of drawing each symmetric 
trapezium with different given dimensions individually, as is noticeable especially in 
Apastamba Sulvasütra. While there is a degree of unity in the descriptions, with the 
concept of the perpendicular a more uniform prescription could have been given for 
the constructions. Furthermore in several constructions, including in the Malhavedi 
as seen above, special Pythagorean triples (not necessarily primitive) were sought 
depending on the desired sizes. While the latter may have offered an amusing di- 
version, in practice it would have been simpler to have a unified way of drawing 
perpendiculars, even with Nyanchana method if that was found more convenient, 
and marking the point on the perpendicular line at the desired distance, producing 
the line if necessary. This possibility does seem to have been realised at some stage. 
In [3] I have noted that the vedis described in the Asl. 6.3-6.4 (niradhapasubandha 
vedi), Asl. 6.6, Asl. 6.7 (paitrki vedi), Asl. 6.8 (uttara vedi), and Asl. 7.1 involve 
varied shapes (two trapezia of different dimensions, two squares and an oblong rec- 
tangle), but the construction of each of them refers to “Having stretched (the cord) 
by the mark at fifteen" (pancadasikenaivapayamya), and taking various contextual 
factors into account concluded that the phrase is used as a way of saying "Having 
drawn a perpendicular"; the desired point on the perpendicular line is meant to 
be marked on that line by measuring out the requisite distance. This marks a step 
towards conceptualisation of the perpendicular at a practical level, which however 
does not seem to have been abstracted further. 


2. Pythagoras Theorem and its Applications 


The most notable feature of the Sulvasütras in terms of geometric theory is the 
statement of the so called Pythagoras theorem. This stands out especially in the 
context of the fact that some of them, especially Baudhayana, predate Pythagoras. 
There has been a variety of speculation in this respect, including that Pythagoras 
may have got it from the Indians (A. Bürk quoted in [10]) or, in broader terms, 
that there may have been a common source for the geometry of the Greeks and the 
Indians (see Seidenberg [17] and [18]). Available inputs seem inadequate to have a 
meaningful discussion on this, and in any case we will not go into this aspect here. 
The main discussion below will pertain to the role of the theorem in the overall 
context of the Sulvasütras themselves. 
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Sulvasütras do not contain proofs, and there are no indications of how the 
statements were arrived at; in this respect the analogy with manuals is pertinent. 
The idea of proof in the sense of Euclidean geometry does not really apply to 
the context of the Sulvasütras, since they were essentially concerned only with 
the practical aspects, and at best one can ask how they would have reached the 
individual conclusions, say in terms of practical geometry. While in the case of 
many statements the knowledge may be attributed to geometric intuition, it is 
difficult to see how they may have come to the statement of something like the 
Pythagoras theorem. We will discuss some of the possibilities that have been pro- 
posed in this respect. At the end of the section I propose a new hypothesis on 
how the Pythagoras theorem may have been thought about in the context of the 
Sulvasütras. 

Neither the notion of a right angle nor of a right angled triangle are found in 
the Sulvasütras, as concepts; of course right angled triangles appeared as parts 
of various figures, and were implicit in the Nyanchana operations, but were not 
identified separately. Thus statement of the Pythagoras theorem occurs not with 
respect to right angled triangles, but rather with reference to rectangles. A close 
translation of how it is stated in Baudhayana would be “the diagonal of a rectangle 
makes as much (area) as the (areas) made separately by the base and the side put 
together". An identical statement is found in Apastamba. The statement appears 
also in Katyayana where it is followed by a clause “iti ksetrajnanam", meaning 
“this is the knowledge about areas". In Mànava however an equivalent statement 
occurs in quite a different form. A close translation would be “multiply the stretch 
by the stretch and the width by the width; the square root of the sum is how much 
the diagonal is". The difference is striking. While the former presents the statement 
about areas, the latter describes it as an algorithm for computing the length of the 
diagonal. Together with the fact that many geometric constructions described in the 
other Sulvasütras are missing from Manava, this suggests a shift from a geometric 
approach to either algorithmic or functional approach. We shall however see later 
that Manava also contains some interesting original geometric statements not found 
in the other Sulvasütras. 


The Pythagoras theorem is also applied in the Sulvasütras towards the following 
theoretical objectives. 


To describe a square whose area is the sum of areas of two given squares: 'This is 
covered in two parts, one when the given squares are of equal size, another when 
they are unequal. For the former the answer is given as the diagonal of the square. 
For the latter the prescription is to mark the side of the smaller square on one of 
the sides of the larger square (from one of the vertices) and take the diagonal of 
the rectangle formed by it with the other side of the bigger square (Figure 5; PC is 
the prescription for the side of the sum of the squares with sides P A and P B). The 
square on this diagonal has area equal to the sum of the two given squares. This is 
of course a direct application of the Pythagoras theorem. 
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Figure 5: Square with area equal to the sum of two given squares. 


To describe a square whose area is the difference of areas of two given (unequal) 
squares: For this again one marks the side of the smaller square in the larger one 
from a corner, forming a rectangle as before. An arc is drawn with one of the 
vertices of the rectangle as the centre and the longer side as the radius, meeting the 
opposite side of the rectangle. The portion of that side that is cut out, from the vertex 
nearer the point picked as the centre, is the answer (see Figure 6 ; BC as in the 
figure is the prescription for the side of the difference of the squares with sides PA 
and P B). 

It may be seen that the cut out portion together with the smaller side of the 
rectangle are sides of a rectangle whose diagonal equals the side of the bigger 


P B A 


Figure 6: Square with area equal to the difference of two given squares. 
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square, and hence by the Pythagoras theorem the square over it has area equal to 
the difference of the two given squares. 

The summing or augmentation of squares with other squares was involved in 
producing figures with multifold area than a given figure with the same shape. For 
this the procedure in the construction of the figure would be followed as before, 
with an enlarged unit. To double the area one would replace the unit by the size 
diagonal of the unit square; this was referred as dvikarani. For tripling the area one 
would replace it by the side of the square obtained by augmenting a square with 
area two with the unit square (trikarani), and so on. 

The square with area equal to the difference of two squares does not seem to have 
had any direct application in the construction of altars or fireplaces. It was however 
involved in meeting the following geometric objective, which in turn was needed 
in practice as we shall see below. 


To transforma rectangle into a square with the same area”: For this purpose, forming 
the square over the smaller side, the remaining rectangle is divided into two equal 
parts parallel to the square and the farther part is then taken off and put adjacent to 
the other side of the square (see Figure 7). 


P L M Q 


Figure 7: Transforming a rectangle to a square 


The original rectangle is thus transformed into a gnomon, viz. a difference of 
two squares (the squares over sides PM and LM in Figure 7). How to transform 


Pir may be worthwhile to note here that the problem is dealt with Euclid’s Elements in Book II, 
Proposition 14, based in turn on Book II, Proposition 5, and the procedures described by Baudhayana and 
Euclid closely correspond to each other, though the presentation differs in certain details. It would appear 
that they would have drawn the figures somewhat differently, but this is a matter of speculation. 
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this into a square is already described; this is how they put it also (!); tasya nirhàra 
uktah, “resolution of this has been explained", so states Baudhayana. 

As rectangles could thus be transformed into squares one could in turn augment 
a square with a given rectangle. This was involved in the following. Various yajnas 
were to be repeated periodically and for each successive performance, the area of 
the fireplace (agni) had to be enhanced, while maintaining its shape. In many cases 
the first fire altar had area 7i purusas, followed by 81. 94. and so on, and it could go 
upto 1014 purusas?. To construct a fire altar with an enhanced area, one would carry 
out the same procedure as before with the original unit replaced by an appropriate 
larger one. This would apply independently of the (often intricate) shape of the fire 
altar. To increase the area from 74 purusas to 84 purusas, for instance, the unit 
would have to be increased so that Ath purusc area would be added to each purusa 
in forming the new unit. The latter could thus be obtained as the side of the square 
combining a square purusa with a rectangle of area 5th purusa. 

It may be recalled that the converse of the Pythagoras theorem is also a simple 
consequence the theorem itself. Use of the converse in the constructions as seen in 
the last section may also be viewed as an application of the Pythagoras theorem. 
Actually it would seem that the sūtrakāras viewed the converse as a part and parcel 
of the theorem; clubbing the converse with the direct statement is not uncommon 
in the course of intuitive thinking. Otherwise it is difficult to explain why they 
would make it a point to state the direct theorem (as is done in all the Sulvasütras) 
but not the converse that is much more used. There are some other points which 
seem to corroborate this. Following the statement of the Pythagoras theorem in 
Apastamba Sulvasütra there is a clause tabhirjneyabhiruktam viharanam. This 
translates as “the above mentioned constructions are knowable (deducible) from 
this". The constructions referred to are of squares by the Nyanchana method, using 
the triples (5, 12, 13) and (3, 4, 5) respectively, which involves the converse of the 
Pythagoras theorem. Thus the converse seems to be viewed as an integral part of 
the theorem itself, though not stated as such. Similarly. as mentioned earlier, in 
Baudhayana the statement of the Pythagoras theorem is followed by a list of the 
Pythagorean triples, and there is a connecting clause ityetasupalabdhih, translating 
as "this is obtained in". As the Pythagorean triples are more closely connected with 
the converse theorem this would also suggest that the converse was clubbed together 
with the theorem itself. 

Let me now come to the question of how the validity of the Pythagoras theorem 
may have been concluded? As there are some Pythagorean triples occurring in the 
Sulvasütras it has been suggested that observation in a few cases was extrapolated 


3This is according to [19] and [11]. It however seems to this author that the original sutra translates 
as going upto 1083 purusas, adding one purusa 101 times; this of course has no consequence to any 
mathematical discussion. 
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into a general statement (see [21]). This explanation overlooks the point that know- 
ing a Pythagorean triple is not tantamount to knowing a right angled triangle (or 
rectangle); a Pythagorean triple would be of significance only after one has arrived 
at the Pythagoras theorem, and not lead to it. Datta [4] (see also [19]) has proposed 
possible geometrical arguments they may have known. The connection of the con- 
struction involved however seems rather distant from the Sulvasütras constructions 
for the argument to be convincing. 

The following possibility does not seem to have been considered in literature. 
From the context of the Sulvasütras we see that they faced the question as to how 
the squares add up (rather than the other way question as to what is the size of the 
diagonal, or the square over it)? Augmentation of squares, for the purpose of the 
working out the size of the enhanced unit as explained earlier, was in all likelihood 
the motive behind the discovery of the theorem, as has been noted already by 
Seidenberg [18]. If one were to contemplate on the question in the old times prior to 
the theorem being discovered, with the conviction of a mystic that there has to be a 
natural choice, it seems natural that the diagonal of the rectangle with the two sides 
of the two squares (perhaps the one emanating from the common point when the 
squares are set on the two sides of the rectangle) would pose as a natural candidate to 
give the same area*; what else can it be? Experience with square tilings would also 
aid such an intuition. Having guessed the answer it could be confirmed in various 
ways; it does not even have to be only with triangles with rational sides, though the 
latter would indeed facilitate verification in some ways. 


3. Transforming a Square into a Circle 


Towards the end of the purely geometric part Baudhayana gives (in sutra I.58) a 
geometric construction to produce a circle with same area as that of a given square. 
Essentially the same procedure is described in the other three Sulvasütras, though 
the wording in each of them varies a little, including in usage of the geometric terms. 

Finding a circle with a given area, for which the above procedure would be 
used, was involved in the construction of fire altars in the shape of a chariot wheel 
(rathacakraciti), a circular trough (with a handle) (dronaciti), rounded tortoise 
(parimandala kürmaciti). 

Given a square, the prescription for obtaining the desired circle goes as follows. 
Take half the diagonal of the square and drop it from the centre along the midriff, and 
draw the circle including a third of the part jutting out (see Figure 8; QR is 1 rd of 
QS, and PR is the prescription for the radius of the desired circle). For a square with 


side 2a the prescribed radius works out to be a + $ (v2 — 1) a= (2 + v2) a/3. 


^This may be compared with realising that the resultant of two forces at a point represented by two 
vectors is given by the vector represented by the diagonal of the parallelogram formed by the two. 
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Figure 8: Circling the square. 


The area of this circle for the unit square works out to be 1.0172524..., slightly 
more than that of the original square.? 

In Baudhayana there is no further comment on the sütra. On the other hand a 
sutra in Apastamba (see the next paragraph) says in particular “as much is left out 
that much comes in”, the reference being to the portions of the square that are left 
out when the circle is drawn and those which are incorporated inside. The comment 
gives an insight into how the choice for the radius of the circle was arrived at. 
Observing that the diameter should be between the length of the side and that of 
the diagonal, they looked for a proportion of the extra part which would ensure that 
as much area is left out that much comes in. The correct proportion for this (in the 
light of the modern knowledge) turns out to be 0.30993473..., and if it is to be 
approached by a simple fraction, as one of certain number of parts, then it would 
have to be one in three. That is the number they picked, presumably from intuitive 
considerations. Incidentally, if they had chosen the proportion to be 3 out of 10 (not 
inconceivable in the context of the number 10 having acquired significance as a 
base in counting numbers) they would have got a better result; for the unit square 
they would have got a circle with area 0.99271948.... 

The sutra from Apastamba relating to the area of the circle is followed by 
sanityamandalam and then yavaddhi yate tavadagantu. The second part means “as 
much is left out that much comes in", which was quoted above. There has been 


SIn the literature there is an inclination to talk of something like “what value of x this amounts to", 
if the area of the circle is to be | as expected (it tums out to be 3.0883118.. . .). I find this peculiar; there 
was no concept of x in the Sulvasütras, and they were not computing such a ratio. The comparisons 
should be in terms of the output produced, as followed above. 
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a debate in the literature on what the first part stands for. One of the commenta- 
tors Karavindasvami interprets the lines as “The circle is exactly as large as the 
square, for as much the circle falls short, so much comes in". Another commentator 
Kapardisvami breaks up sánityà as sa anityà and asserts that the circle construction 
is an approximate one. While linguistically both the interpretations are possible 
for the first part it seems certain that Apastamba meant to state that the circle is 
exactly as large, and without that the following part would be lacking in context. 
It does not seem meaningful to say "this is an approximate construction; as much 
is left out so much comes in". The alternative interpretation is motivated by the 
call, after a passage of time, to interpret it in a way consistent with the knowledge 
that it is in fact an approximate construction; this point has been made in [20]. 
The fact that it is only approximate would I believe have been suspected in the 
times of the Sulvasütras. The construction was applied in building fire altars of the 
size of 7i purusas (approximately 45 square meters), to be tiled into 200 tiles. A 
close to 2 % error would have been noticeable in such a context to some of those 
directly involved with the altar building. On the other hand, the error is not big 
enough to make a definitive case about it. As I see it, Apastamba's statement is 
aimed at putting forth forcefully, in the face of doubts (murmurs ?) of sceptics, the 
scholastic position of the time: that is the exact circle; as much is left out so much 
comes in. In course of time the suspicions would have gained weight, serving as 
motivation to look for an alternative construction, and one was indeed found by the 
time of Manava Sulvasütra, which we discuss below. In Katyayana however only 
the original construction is described; even though it is a later Sulvasütra, being 
in a different stream it was perhaps untouched by the development around that 
sutra. 

Manava describes another construction for the circle with the area of the given 
square (cf. stra 11.15 in [19] and 10.3.2.15 in [11]). While it actually happens to 
be more accurate as we shall see below, the statement is very unclear, as a result 
of which it does not seem to have been understood properly; in particular I believe 
the comment on it in [19]° is unwarranted, and the algebraic presentation in [11] is 
incorrect. Let me first present my interpretation of what the sütra means, and then 
discuss the wording in the sütra and the lacunae making it unclear, that apparently 
led to the confusion over it. 

Given a square, draw a line parallel to one of the sides and dividing the square 
into one-third and two-third parts (see Figure 9). Extend the line to meet the circle 
through the vertices of the square. Now consider one of the segments of the line 
from the square to the circle and take the point at one-fifth of its length, from 
the point on the square. Draw the circle passing through this (with centre at the 
midpoint of the square); in Figure 9 QR is i th of QS and PR is the prescription 


ÓThe comment is: "Possibly these are not problems of quadrature of the circle. Ordinary squares are 
drawn without any mathematical significance." 
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Figure 9: Manava construction for circling the square. 


for the radius of the desired circle. For a square of side length 2 the length of the 
segment between the square and the circle is seen to be (2 — 1). so the square 


of the radius of the circle is {1 + i (7 = yl + i, and this yields the area 


of the circle to be 3.9787159...; for the unit square the area of the prescribed 
circle will be 0.99467897 ..., a much more accurate value compared to the earlier 
one. 


The wording of the sütra however has a lot to be desired from the point of view 
of clarity. Firstly, rather than asking the square to be cut into three equal parallel 
strips the sütra asks it to be divided into 9 equal squares, via trisection along both the 
sides; presumably this is on account of a preoccupation with symmetry. Secondly, 
there is no mention of the outer circle. This seems to have led the authors of [19] 
to give up on the meaning of what follows, and content themselves presenting 
only an ad hoc translation, with the comment as noted. In [11] the author has been 
prescient of the unmentioned outer circle and that a portion of the segment between 
the square and the circle is involved. But here there is confusion in interpreting the 
second half of the sütra. This part has a clause /umpetpuriseneh which translates 
as "cover it with loose earth", meaning obliterate it (a part of the segment as 
above), but there is ambiguity about what part is to be obliterated; it could refer to 
obliterating i th part keeping the rest, or the other way around. In [11] the former 
interpretation is adopted, which may be appropriate linguistically, but is not what 
is actually intended. From the whole context it is clear that the remaining i th part 
of the segment was meant to be obliterated; it would be too much of a coincidence 
otherwise for the value to come out as close as seen above. It may be borne in mind 
that at the time of the Sulvasütras these shortcomings in the presentation would not 
have mattered, as the correct meaning would have been handed down orally. The 
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text was intended only to serve as an aide memoir. In particular the unmentioned 
outercircle would have been presumed on account of the analogous one in the earlier 
construction. 


4. Squaring the Circle 


The problem of "squaring the circle" viz. finding a square with the area of a 
given circle is also considered in the Sulvasütras. Baudhayana Sulvasütra, which 
has the best treatment of the issue from among the Sulvasütras, deals with it, 
not in terms of a geometric construction as with the converse, but by relating 
numerically the side of the desired square to the diameter of the given circle; 
it has been argued in [9] that sutra 3.2.10 from Manava is in fact a geometric 
construction for squaring the circle, which we shall discuss later in the section. 
Baudhayana gives two formulae in this respect, the second being qualified as an 
approximate one. The first of them we shall discuss in some detail below. The 
second consists of taking th of the diameter of the given circle for the side of the 
desired square. Curiously, only this more approximate formula has been described 
in Apastamba and Katyayana Sulvasütras for the purpose. This formula is very 


crude: for the circle of unit radius it gives the square with side 26 whose area would 


be (3r = 3.0044444 . . ., in place of the correct value 3.1415927 . . . , smaller by 
more than 4%. Even in terms of fractions with small denominators there are better 
approximations possible. The Egyptians took Sth of the diameter as the side of the 
corresponding square (see [8], Chapter 13). This gives for the unit circle the square 
with area 3.1604938 . . . , which is more than the actual value by just about 0.6%. 
Another simple fraction, e as the ratio of the side of the square to the diameter 
also yields a better result: for the unit circle the area of the square would be 3.0625, 
which is about 24% less than the actual value. Incidentally, i would have been a 
natural choice for Baudhayana, in giving an approximation, since as we shall see 
below it is a component of the refined value given by Baudhayana. It would seem 
that B was the traditional value (going farther back in time) and even after the 
refined formula was discovered the traditional value continued to be quoted without 
modification. 

In the construction of the altars there does not seem to be any occasion to convert a 
circle into a square. This would suggest that the relation as above was recorded in the 
Sulvasütras (all four) only to convey a general sense of magnitude (and perhaps in 
deference to tradition), On the other hand the other value described in Baudhayana, 
that was referred to above and we will now discuss, is apparently determined with 
serious effort. 

The refined formula for the ratio of the side of the square to the diameter of the 
circle with the same area is described as follows: divide the diameter into 8 parts, 
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divide one of them into 29 parts and remove 28 of the parts and from the remaining 
one remove a sixth part and include eighth of it; thus it is 


7 l | ] 
3 8x29. 8x29x6 19-7 20828 


The fraction is 2755 and for the circle with unit radius yields the area 3.0883265 . 
a little more than 98.356 of the actual value. 

A discerning reader may notice that Baudhayana’s construction of the circle with 
the area of a given square produced a circle whose area was bigger by about 1.7% 
while the above prescription in the opposite direction produces a square which is 
smaller in size by about 1.7%. In other words if one started with a square followed the 
construction as in the last section to getacircle and then used the above formula to get 
back the square it would be very close to the original. A more precise computation 
(in place of what is pointed out as something to be noticed at first glance) shows 
that if one starts with the unit square, transforms it into a circle and then transforms 
that into a square as indicated, one gets a square with area 1.0000024 . . . (!). This 
is no coincidence. The desired fraction as above was evidently computed as the 
inverse of the ratio involved the other way, namely in circling the square, or else we 
could not have got such a close answer. If the computation of the inverse could have 
been exact the above number would have come out to be exactly 1. However in the 
computation of the inverse certain approximations had to be made, leading to the 
small difference. 

It may be recalled that for the unit square the diameter of the correspond- 
ing circle obtained by the Baudhayana method would be (2 + v2) /3. For v2 


Baudhayana has a(an approximate) formula, which we shall discuss in detail in 
the next section, associating with it the value 1 + E T z = wha and the ratio 


(2 + v2) /3 to be inverted would thus be 1 + $ + 5L; — 5-43; (expressed in 


the style of the Sulvasütras). By our current method we would write the number 


as 4333 and produce +224 as the inverse. However, this method may not have been 
os 1393 


known at that time. Thus they either did not identify the number as 155; or did 
not consider the reciprocal Ha for the inverse formula, for one or other reason, 
including possibly that it can not be expressed in terms of making smaller number 
of parts as in the above expressions, as 1393 is a primes Instead their computation 
seems to have gone along the following lines. A part 2 was first separated as the 
significant part; this could be either following a way that they may have had to find 
the inverse of a fraction expressed as a sum of the terms as above, or could be an ad 
hoc choice noticing that the inverse should be greater than that. The remaining part 
after separating out 7 turns out, in our notation, to be z33xg: They however opted 


to replace it by $45, which could be split up as g55 — i555 + gzjgsgxg thus 
arriving at the quadrature formula. 


m ni 
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It does not seem to have been noted in the earlier literature, but it turns out that 
taking only the first two terms in Baudhayana expression yields a better result. 
The fraction would then be i which for the circle with unit radius yields the area 
3.0927467 . . . , closer to the actual value z = 3.1415927 ... compared to the value 
3.0883265 ... obtained above. Thus the additional two terms in fact worsen the 
result in terms of the correct value for the area of the circle. Of course this could 
not have been anticipated by Baudhayana. Interestingly, however, if the alteration 
of 1393 in the denominator to 1392 were done before separating out the part 4 then 


they would have ‘accidentally’ got the inverse to be E as above. This however is 


not as close to the inverse of [2 + v2) /3 as Baudhayana’s value was seen to be, 
which would have been the objective at that time. 

The choice of i as the part to be separated is interesting, especially if it was ad 
hoc. Why was it not chosen instead to be B. the supposed approximate value. Was 
it known that it was better than the latter, even though it had not been stated? Why 
did it then continue to be unstated even after the refined formula was exposited. 
Or were they from two different substreams of thought, even though they occur 
alongside each other in Baudhayana? 

Let me now come to the sutra from Manava Sulvasütra, sūtra 3.2.10, to which 
a reference was made in the beginning of the section. Some developments around 
it are rather puzzling. Though the sutra is rather condensed, an overall reading 
readily indicates that it has something to say on the question of squaring the circle, 
as also the converse. However various commentators, including van Gelder who 
produced the first edited version and English translation, have interpreted it to 
relate to only the converse problem of circling the square, in a way equivalent to 
that in Baudhàyana. A new interpretation was given by Hayashi in [9], after some 
emendation of the original text which has been justified on grounds of rectification 
of possible corruption of the manuscript over a period. According to [9] both the 
problems of circling the square and squaring the circle are addressed in the sutra. 
The square with the area of a given circle is described to be of side equal to the height 
of the equilateral triangle over the diameter of the circle; the converse statement 
continues to be interpreted to be essentially as in Baudhayana, but there is also a 
slight variation offered (attributed to the referee of the paper). It may be observed 
that for the circle with unit radius the side-length of the stipulated square would be 
/3, which yields the area of the square to be 3, in place of x = 3.1415927..., less 
by about 4.5%. This is a rather crude value, much cruder than even the approximate 
value given in Baudhayana and stated also in Apastamba and Katyayana. In the 
context of the improved result from Manava Sulvasütra that we saw in the last 
section, both direct and converse statements as above seem surprising; the first part 
on account of the crudity of the value and the second being not as sharp as the 
improved result that we noted. However such anomalies seem to be inherent in 
Sulvasütras, as we have seen elsewhere also, since they continued to exposit the 
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traditional values of their time with no critical evaluation, even after better results 
became known. It may be noted in this context that the stitrakdra characterises the 
contents of the above sutra, by a mention in the preceding sutra, as “prescriptions of 
informed persons”, traditional knowledge in other words. The refined method for 
circling the square described in the preceding section comes later (though it is not 
clear if much can be read from the sequencing) and is presumably either due to the 
author himself or more contemporary knowledge of his time. 

While we are discussing the area of circle the reader may wonder what the 
Sulvasütras had to say about the circumference of the circle. In Baudhayana there 
is only a incidental reference to it, where a circular pit “with diameter 1 pada and 
circumference 3 padas” is mentioned, indicating that the circumference was taken 
to be 3 times the diameter; interestingly this coincides with the value according to 
the Bible. In Manava Sulvasütra however the circumference is stated to be 3 + ith 
of the diameter (sutra 10.3.2.13 as in [11]; the interpretation of the sutra in [11] as 
describing the side of the square is incorrect); the statement of the ratio is followed 
by "not a hair-breadth remains"! Evidently a lacuna in the classical value had been 
overcome and it was worth exulting over it! 


5. The Square Root of 2 


Three of the four Sulvasütras, Baudhayana, Apastamba and Katyayana, describe 
the following formula for 4/2 (in words) expressing the value as 


I l 
1+ 37 3 monies 


The sutra is followed by savisesah in Baudhayana, sa visesalt in Apastamba and sa 
visesa iti visesah in Katyayana. The word visesah means "extra" and the wording in 
Katyayana Sulvasütra conveys that the number is in excess of the desired quantity 
(cf. [12]). visesah was also used as a technical term for the excess of the diagonal 
over the original measure, and savisSesah thus signified “together with the extra", 
namely the diagonal itself. As such the term itself is not connected with the value 
being considered to be in excess, or approximate. 

Though we would not know how they obtained it, such a complex expression 
would have been arrived at by some process (and not directly by intuition or 
guesswork). Whatever the process, along the way it would have been noticed that 
the targeted magnitude has not been reached exactly. Thus it would have been 
known, at least at the time when the value first came to be assigned, that it was not 
exact. (It is also tempting to argue that they could have squared the number and 
found that it is not 2, but it is doubtful if they would have taken up such an arithmetic 
exercise). 
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In decimal expansion the value of the expression is 1.4142157..., which is 
remarkably close to the actual value 1.4142136.... The closeness is striking. A 
few points should be noted in this respect however. Firstly, it may be recalled 
that the Babylonian cuneiform tablet YBC 7289 (Yale Babylonian collection), 
dating back to around 1800 to 1600 BCE also contains an approximate pus 
p v2. in the sexagesimal system as 1;24;51;10, which yields the 1 + = z 


A T A = 1.4142129.... Furthermore the Babylonian value described in a pre- 
existing standardised system with uniform subdivisions, namely the sexagesimal 
system of representation, carries an intrinsic sense of the level of accuracy. The 
Sulvasütras value however is evidently the output of a specific process, and there 
is no intrinsic gauge of accuracy involved. No details of how the procedure went 
are found in the Sulvasütras but some likely methods have been proposed, that we 
shall discuss Vou In a context of these it would seem that while refining on the 
approximation 1 + + f+ p= = a = 1.4166667..., the next step took them directly 


to the above fine value, due to certain special features of the number 4/2; it could 
thus have been fortuitous to have got so close to the actual value. 

Actually there is also no indication that they had serious interest in accuracy 
of that level. No other nonexact relation found in the Sulvasütras is that close. It 
may be recalled that even when better values were available in the case of the 
quadrature formula, for instance, only a rough approximation was recorded in 
the later Sulvasütras. For 4/2, Manava, long after Baudhayana, proffers only the 
approximate value #2 7. Thus while they were deeply preoccupied with geometric 
aesthetics, high ena accuracy is hardly seen to be considered seriously as an 
issue to be pursued. 

There have been various suggestions on how the formula may have been arrived 
at. One convincing possibility put forth by B. Datta [4] (see also [19]) goes as 
follows. The presentation here is essentially along the lines of [19] with some 
modifications for clarity. We have two unit squares to be put together and formed 
into a square. Divide the second square into three equal parts by lines parallel to 
one of the sides. Two of these you attach at the sides of the first square sideways; 
s produces a gnomon (C2 en 10). Divide the third part into a square of side 
3 + and a rectangle of ieas 5 and + . The square part is used to convert the gnomon 
into a square of side 1 +} . The rectangle i js divided into four equal parts along the 
length, into rectangles ath sides 2 3 and 4— Uu z- Lengthwise two of these strips can 
be placed along the sides of the square formed earlier, side bY side. We thus get a 
square except for a missing little square at the corner, of side + D 

Now ynat We senli is a square which B a difference of a equate with side 
ae +54 a= 15 and a small one of size 15. If strips of width ;;4,; are cut out 
im the two sides, that would adjust for thes Oae to be removed, except for a tiny 
square with side 77-33 formed of the overlap of the two strips. The latter is ignored, 
and the square formed by cutting out the strips is produced as the answer. 
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Figure 10: Possible route to the formula for 4/2. 


It has been argued in [7], convincingly, that the Babylonian value of 4/2 (noted 
above) would have been obtained by a procedure involving the following idea. 
Suppose we have a square with a given sidelength and we would like to get a 
square with an area larger by a certain amount which is relatively small compared 
to the area of the original square. The difference between the desired new square 
and the original one is seen to consist of two (long) rectangles and a small square 
(the reader may see Figure 6 for reference); the long side of the rectangles has the 
original sidelength and the other side corresponds to the desired increment. As the 
remaining square is small, the desired increment in the area divided by twice the 
original sidelength would be a good approximation for the requisite increment in 
the length; the area to be added is thus put in the two rectangles themselves. In 
modern notation, if / is the original sidelength and a is the amount of area to be 
added, then / + 5 is an approximation to 4//? + a; this may be compared with 
the binomial expansion. Similarly, if we have a square of a given sidelength and 
wish to get a square whose area is smaller by a certain amount, then the latter 
divided by twice the length of the original square gives is an approximation to the 
requisite reduction in length; this corresponds to / — 5; as an approximation to 
Vl? — a. 

It is illustrated in [7] in detail how the Babylonians may have arrived at their 
value of 4/2 using the idea, in two steps: Starting with a square with sidelength ; and 
seeking to get a square with area 2, the side is reduced by (? — 2) / (2 x 3) = }.to 
1 4 (in [7] the details are explaiend in the Babylonian sexagesimal notation, but we 
shall not go into it here). This is still a larger value and a second reduction is effected 
to get a second approximation to 4/2. The fraction that one is after for reduction in 


2 
the second step would be (in our notation) (4 - 2) x $ = yi same as the 
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last term in the Sulvasütras expression. The Babylonians did not deal with fractions 
directly however, and the desired numbers were found in sexagesimal points (as 
our calculators would do in decimal points) and that led them to the approximation 
1:24:51; 10 for V2, in sexagesimal notation, as noted above. The possible routes to 
this have been discussed in [7]. Interestingly (and rather fortuitously!) 1:24; 51; 10 
is closer to 2 than 5 = slg which was the approximation sought to be computed 
(according to [7]). 

A procedure as above is well within the scope of the thought process involved 
in the Sulvasütras, and it could well have been known; it is also possible that the 
idea may have come down from the Babylonians, but there has been no supporting 
evidence to that effect. In the case of Sulvasütras the approximation as above may 
have been obtained from the above procedure by starting with a square of sidelength 


14. This is a smaller square and the area to be added is (2 — (3)’) X 3i = 3h 
which yields the first approximation B. In the second step this would lead to the 
quoted value, as seen above. This seems to fit better with the way the number is 
expressed. 

It has been pointed out in [7] that the approximation as above is mathematically 
equivalent (emphasis as in original) to Heron's method for extraction of square 
roots, which involves the following: Let A = /? + a. If l is a (first) approximation 
to VA, A/I also approximates V/A, and moreover if | < A then A/I > V/A 
and vice versa. Therefore 5 (I + 4) is candidate for a better approximation. But 
5 (L+ 4) = HF 3(A— D) = 1+ 5. the approximation seen earlier. It may therefore 
seem tempting that to argue that the approximate expression for 4/2 was obtained via 
such averages. However, the arriving at it in this way involves a kind of familiarity 
with algebra that is not evidenced in the Sulvasütras; [7] does not seem to suggest 
itin respect of the Babylonians either. 

Some authors have also brought in the more elaborate expression / + 5; — 
(4) /2 (L+ $) for V2 Fa ([15] or [19], and [2]) as explanation for the ap- 
proximations for /2 in Sulvasütras. Some of the other suggestions, including 
one by Thibaut, who produced the first English edition of Baudhayana supple- 
menting it with his commentary, also involve taking difference of squares (see 
[20] or [19]), relying on some arithmetic. These explanations however do not 
seem natural for the context, and seem to depend on ideas that evolved only in a 
later period, unlike the geometrical explanations above which are consistent with 
Sulvasütras style. 

Here is another possibility which I would like to propose, taking the context of 
the Sulvasütras into account. Having found the need for a numerical value for the 
diagonal of the square (I will argue below that this was the case), one may simply 
choose to find it by measurement with the ropes. This seems natural also from the 
point of view that they were using ropes for a variety of constructions. Take a rope 
of length equal to the diagonal of a square. Mark out one unit from one end. By 
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measuring the unit piece in terms of the remaining piece one notes that the latter 
is more than 1rd and less than half. We take away ird from it and consider the 
remaining part. It is now easier to compare the remaining piece with the second 
piece of length 1, rather than the unit. The remaining piece turns out to be SIEHE 
less than 4th of the second piece. Thus the diagonal is slightly less than | +4 XL i Lx 
How much less? You measure the third piece with the length that fell short. iti is very 
close to 34 times. Thus you decide to take away ith of the last piece, which gives 
the formula. One may start with a rope of length 35 feet for the unit, so the required 
total rope length, namely the length of the diagonal, is about 50 feet. In this case the 
length of the final remaining piece as above will be over an inch and the penultimate 
piece to be measured with it would be about 3 feet, comfortable sizes for manual 
measurements, establishing that it is almost exactly 34 times. Note that use of over 
50 feet long ropes was quite common in the Sulvasütras constructions; the rope 
involved in the ekarajjuvidhi for the construction of the Ma/tavedi measured over 
100 feet. There are some variations possible over the above theme, and though one 
may ask why this one was picked, the answer can simply be that through trials this 
would have been found most convenient. 

What was the motivation of the sütrakáras for determining the value of J22 
We have seen before that the 4/2 is involved in the Sulvasütras as the diagonal of 
the unit square. It is called dvikarani. As a magnitude it was involved in doubling 
the area of the figures to be drawn and, as mentioned earlier, this was achieved by 
enhancing the reference unit to the size of the diagonal of the corresponding unit 
square, and following the same steps of construction as before, for the figure to be 
drawn. For this purpose however one does not actually need the numerical value 
of 4/2. Nor would the complex formula be of much help in producing a mark for 
dvikarani, as it would involve the cumbersome process of subdividing the segments 
into as many as 34 parts!. It would be much simpler to draw a square and mark its 
diagonal on the rope. The numerical value was needed on the other hand in deriving 
the quadrature formula, in order to find the inverse of (2 + v2) /3, as seen in the 
last section. As noted by Seidenberg [17] almost certainly this problem served as the 
motivation for finding a numerical value for V2. Of course, as the reader may recall 
from the earlier sections, the later Sulvasütras, Apastamba and Katyayana included 
the value of 4/2 but not the refined quadrature formula as in Baudhayana for which 


7In [10] itis stated (on page 361), referring to the papers of Albert Bürk on Apastamba Sulvasütra, that 
the approximate formula for /2 was used for the construction of a square. The square that is referred 
to is in fact the one described towards the end of Section 1, constructed by the Nyanchana method with 
the triple (1, 1, 2). For the length 4/2 the original sütra asks to use the savisesa/t. This apparently has 
been interpreted by these authors as using the approximate value for it than the diagonal itself. However 
savisesah stood for diagonal as well, and there is no reason to suppose that in the above construction they 


followed the cumbersome procedure involving the formuia for ¥/2 to fix the length, rather than using the 
diagonal itself, in the manner I indicated. 
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the value of 4/2 was used. This could however be simply due to the fact that once 
the value was found it may have seemed appealing to the later authors. 

There has been unwarranted speculation in the literature, imputing knowledge 
of the irrationality of 4/2 to the sūtrakāras (see [4], the discussion on p. 195). 
They would have been aware that the value they gave was an approximate one, 
and that is the closest it gets. To infer from this the knowledge of irrationality 
of 4/2 is to miss or to trivialise the significance of the notion of irrationality. 
Irrationality is not just about observing that attempts to express a number as a 
fraction yield only approximate results (even this can not be said to be subsumed 
in its entirety in the Sulvasütras as there is but one approximate value described), 
but to be able to assert, with valid argument, that it is impossible to express it 
as a fraction. The Sulvasütras were concerned with practical aspects of geom- 
etry of altar building, and some of it they abstracted, developing a theoretical 
understanding. One does not find them getting involved with philosophical or arith- 
metical aspects of numbers, and there is no point in indulging in wishful thinking 
about it. 
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On the Origins of Zero 


Frits Staal 


1. Indic Origins? 


IT IS WIDELY believed that zero originated in Indic Civilization but the evidence 
in support of that belief is not only meager; it is almost zero. No place or time, let 
alone the name of a discoverer or inventor, has ever been suggested. How can we 
handle such a problem? We must start from the beginning. 

Indic Civilization starts with the Indus Civilization which is earlier than the 
Vedas. Its inscriptions exhibit occurrences and sequences of circles that resemble 
the numerals that have expressed zeroes in more recent times; to be a little more 
precise: more than three thousand years later. Other civilizations roughly contem- 
porary with the urban complexes of Mohenjo-daro and Harappa used circles also, 
but they did not refer to zeroes. On cuneiform tablets from Uruk in Mesopotamia, 
dating to 3,000 BCE, circles refer to the number “2” (Tropfke 1980: 29). It weak- 
ens the suggestion that the Indus circles were expressions of zero. A more serious 
difficulty is that the language of the Indus inscriptions is not known. It is not even 
clear that it was a language and its uses are controversial. Some of these topics are 
discussed in Staal 2008: 7-11, which provides the evidence and further references. 

With Vedic mathematics we are on firmer ground. We find not only geometry 
but integers, a rudimentary decimal system for counting. It did not include nega- 
tive numbers, but “addition, subtraction and perhaps multiplication of whole num- 
bers” (Hayashi 2003: 360-61). The Rigveda made use of recursion (Divakaran, 
forthcoming). It did, moreover, distinguish between cardinal and ordinal numbers 
(Renou: 1964: 92; Staal 2007: 589-590; 2008: 272-273). In all these cases we are 
dealing with numbers, not with numerals: the Vedas are an Oral Tradition since 
there was no writing on the subcontinent prior to the Buddhist Emperor ASoka who 
reigned from 268 to 231 BCE. But “zero” did not only lack a symbol. There was 
no term for it in the oral tradition. The word kha, which Indian mathematicians 
used later to denote zero, occurs in Vedic only in the senses of “hole”, “opening”, 
“vacancy” or "space". 

Counting boards based upon the decimal system took another step but the Indic 
evidence is of later date and the empty spaces are zeroes of a kind, not symbolic 
expressions. Even today, the Indian pandita or traditional scholar uses neither 
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counting boards nor books: he carries his knowledge in his head. Writing had 
its commercial uses, manuscripts go back to the beginnings of the Common Era, 
but it began to be used widely only after the invention of Hinduism in the early 
nineteenth century (Staal 2008a). 

Fortunately, it is not the end of our story since those who look for the origins of 
something, even if it is zero, must look beyond the domain where it is customarily 
located, even if it is absent. The remaining parts of my essay attempt to do so. Part 
2 will discuss two pioneering investigations. Parts 3 and 4 will take us beyond the 
history of mathematics. Part 3 will pay attention to linguistics and Part 4 to the 
Vedic theory of ritual, not included in modern classifications or curricula though 
regarded during the period of middle Vedic as an exact science. Having gone that 
far we must recall, that “exact” and “science” are often no more than labels and 
that all names of disciplines are due to us, not to the universe to which we belong. 


2. Khmer and the Buddhist Madhyamaka 


Two original contributions, undigested legacies from the twentieth century, have 
to be taken into account if we wish to understand how zero may or may not have 
been discovered. The first is due to Joseph Needham, the famous scholar and 
scientist who published, together with his collaborators, the many volumes of Sci- 
ence and Civilization in China. The second is due to David Ruegg, a brilliant 
Sanskrit scholar whose early work dealt with the Sanskrit grammarians but whose 
chief contributions since have been to Buddhist Studies, primarily as expressed in 
Sanskrit and Tibetan sources. 

Needham presented his ideas in the third volume of his series. It is entitled 
“Mathematics and the Sciences of the Heavens and the Earth” and was first pub- 
lished in 1959. It is important to understand what Needham tried to do for he 
has been criticized and misunderstood like other pioneers. He did not examine 
Chinese sciences from a simple “evolutionist” concept of history, as if they were 
“more or less clumsy attempts to express modern scientific ideas,’ a notion that 
Pingree wisely rejects in another context (Pingree 2003: 45). But neither did 
Needham present Chinese sciences from “the Chinese point of view,” whatever 
that is, as Seyyed Hossein Nasr (1968: 21) tried to do with respect to the Islamic 
Sciences. Needham’s perspective is different and he has expressed it in unambigu- 
ous terms: “to write the history of science we have to take modem science as our 
yardstick—that is the only thing we can do—but modern science will change and 
the end is not yet" (Needham 1976: xxxi with further discussion in Staal 2006: 
91-97). 

D. J. de Solla Price, a historian of science at MIT, described Needham’s work 
as follows: “In my estimation, the essential contribution made thus far by the 
six volumes of Science and Civilization in China lies in the systematization and 
presentation in English translation or summary of the substantive content of the 
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otherwise ill-digested bulk of Chinese scientific and technical literature. Here we 
have the raw material on which generations of later scholarship can be founded. 
Here at last we have some map to tell us where to look, and some indication at 
least of what we shall find" (1971: 17-18). 

Needham's volumes deal with much more than the Chinese sciences. They 
abound in references to Indian, Mesopotamian and European disciplines. The dis- 
cussion on the origin of zero begins to meander in Volume 3 on page 9. Needham 
knows, of course, that zero is widely believed to have originated in Indic Civiliza- 
tion, he is familiar not only with the Chinese but also with Indic counting boards 
and then takes an unexpected turn: he zeroes in on early South East Asian inscrip- 
tions. The thesis that emerges will not surprise us but is not formulated in a few 
simple sentences or in a single paragraph. We get the idea when we combine three 
separate sentences that occur on pages 10-12 and that I quote here because they 
are Needham's own words: 


e “The usual view is that the circular symbol for zero derived directly from 
India, where it first appears on the Bhojadeva inscriptions at Gwalior 
dated +870.” 


e “While the first epigraphic evidence for the zero in India is, as has just been 
mentioned, of the late +9th century, it has been discovered about two cen- 
turies earlier in Indo-China and other parts of south-east Asia. This fact may 
be of much significance.” 


e “It would seem, indeed, that the finding of the first appearance of the zero in 
dated inscriptions on the borderline of the Indian and Chinese culture-areas 
can hardly be a coincidence.” 


I shall not discuss the first sentence though more recent discoveries of Indian 
inscriptions have pushed the dates further back. It neither affects Needham’s the- 
sis, nor the origins of zero as we shall see. The second sentence is based upon 
an early article published in 1931 by another celebrated scholar, George Coedés, 
the French historian of South East Asia and especially of the Khmer Empire of 
what are now Kampuchea and parts of Thailand. He did not only write the classic 
account of the "Indianization' of Indochina and Indonesia that was first published 
in French, often reprinted since 1964 and translated into English as The Indianized 
States of Southeast Asia. He also published eight volumes of Cambodian inscrip- 
tions, and wrote much else that inspired thousands of more recent publications, all 
of them supplementing, improving and updating his work, just as de la Solla Price 
had predicted of Needham's series. As for the third of Needham's three sentences, 
its significance depends on the significance of the second. 

Needham's idea is based upon two inscriptions discussed in Coedès article. 
One comes from Cambodia, the other from Indonesia. Both show the zero, one in 
the form of a dot and the other in the shape of a small closed curve that may be 
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a circle. Both are dated 683 CE, almost two centuries earlier than the alleged first 
Indic inscription of 870. 

Needham does not comment on the puzzling circle, if it is a circle, which comes 
from the small Indonesian island of Banka, but he stumbles on both inscriptions 
and then makes his fatal lapse. He assumes throughout that their language is 
Southeast Asian but does not say which language it is. Is he thinking of Khmer or 
Old Javanese? The simple truth is that the language of the inscriptions is expressed 
in a Southeast Asian script but is none other than Sanskrit. 

This was obvious to Coedés as every careful reader of his article can see. He 
writes on page 325: “au Cambodge, les premières inscriptions sanskrites datées 
font usage des mots symboliques"; *au Champa, les deux plus anciennes inscrip- 
tions sanskrites datées ...en langue sanskrite"; “les inscriptions sanskrites du 
Champa”; “a Java, la plus ancienne inscription sanskrite datée fait usage des mots 
symboliques" (all italics mine). These quotes show that "the first appearance of 
the zero in dated inscriptions on the borderline of the Indian and Chinese culture- 
areas" does not only fail to be a coincidence. They demonstrate that the content of 
these inscriptions is Indic. 

I now come to David Ruegg who wrote a brief but substantial article partly 
concerned with our problem. It is entitled "Mathematical and Linguistic Models 
in Indian Thought: The Case of Zero and Sunyata” and was published in 1978. 
Sunyatd refers, as is well known, to the Buddhist concept of ‘emptiness.’ It is 
a characteristic feature of the Madhyamaka school and was foreshadowed by a 
certain Bhadanta Vasumitra who may have lived at the end of the first or beginning 
of the second century CE. Its context is the theory of dharmas, which do not refer 
to the Buddhist dharma or ‘doctrine, but to ‘elements or factors, each of which 
is considered to bear its own specific characteristic that determines it as Ruegg 
explains the expression of the Abhidharmakosa: 'svalaksanadháranád dharmah’. 

I shall not further some readers’ possible annoyance with Vasumitra’s Sanskrit 
but here is Ruegg’s translation: “A dharma evolving in the [three] times is stated 
to be other according to the different states it enters, [the change in question being 
then] due to otherness of state (avasthantaratah) but not of substance". The words 
and phrases within square brackets are due to Ruegg, who explains the example 
that Vasumitra adds as follows: “like a marker or counter (vartika) in reckoning 
which in the unit position has the value of a unit, in the hundred’s position that of 
a hundred, and in the thousand's position that of a thousand"—a straightforward 
expression of the use of zero as a place-marker in the decimal system. 

Ruegg adds that the same idea is sometimes expressed by the term gulika, “ball” 
or "bead" which, like the counting boards to which they belong, should remind us 
of the fact that mathematicians are not always concerned with what modern readers 
think of almost exclusively, viz. writing. Needham is familiar with this usage of 
Sunya because he compares it to the empty spaces on Chinese counterboards in a 
long footnote on his pages 11-12. 
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Ruegg discusses two terms for zero: kha, which we have already met, and 
bindu, which means "dot". He draws attention to the Vasavadattà, a literary work 
of the sixth century by Subandhu which uses sunyabindu to denote the symbol for 
zero. He refers next to Pingree's work on the Yavanajdtaka of Sphujidhvaja, then 
about to be published. That text was composed in 149/150 CE and used the term 
bindu in “the earliest reference known to the decimal place-value system with a 
symbol for zero in India" (Pingree 1978, II: 406, I: 494). 

The next topic Ruegg considers is the history of the term sunya. He starts 
with the Rigveda which employs suna for "lack, absence, emptiness.” (One of 
the earliest examples is "lack of sons" in RV 7.1.11.) Later Vedic has Sunya in 
the meaning “hollow, deserted”. After providing more information on sunyatd, 
he concludes cautiously that we cannot trace connections between the Buddhist 
"emptiness" and the mathematical concept of zero. 

Ruegg then turns to early linguistics and Panini's lopa. We shall Iook at its 
apparent invisibility in the next section. Ruegg notes its occurrence in modern 
linguistics but then diverges from our topic in grammatical and philosophical di- 
rections that involve sunyatà without throwing light on the zero. His cautious 
peregrinations have inspired my own meanderings. The reader should note what 
has not been shown and remember Pingree's statement: the earliest written refer- 
ence to the decimal place-value system with a symbol for zero in India is dated to 
149/150 cE. We have to look not only beyond writing but further. 


3. Zeroes in Sanskrit Grammar 


Almost all Indic mathematicians wrote in Sanskrit, the classical language of sci- 
ence that unites the subcontinent (Staal 1995). Malayalam is among the famous 
exceptions (Divakaran 2007). Persian and English became more common in pre- 
modern and modern times but Sanskrit continued in mathematics and Jyotihsastra 
or astronomy-cum-astrology (see, e.g., Minkowski 2002, 2008). Many of the clas- 
sics of Indian mathematics were composed in concise and sometimes elegant San- 
skrit verse. Here is young Aryabhata on the subdivisions of time: 

"A solar year is a year of men. Thirty of them make an ancestral eae Mul- 
tiplied by twelve is explained as a year for the gods" (ravivar$a? manusyam 
tad api trimgadgunam bhavati pitryam| pitryam dvádasagunitam divyam 
varsam samuddistam: 3:7) Another couple of lines condenses the fuligine =k 
in one couplet (1:12). And who does not know the penultimate ues er e 
ocean of true and false knowledge I have, through the boat of MY M e 
rescued with the grace of the deity the precious sunken jewel of beds Ife ed 
(sadasajjndnasamudrat samuddhrtam devataprasadena | sagna 
nam maya nimagnam svamatinava) į have been their 

How did these mathematicians know Sanskrit? It could n9 was not or else 
first or native language. Many were not brahmans (Aryabhat? 
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his name would have been Aryabhatta!); and neither need all brahmans know 
Sanskrit. Arya, moreover, does not refer to “the three twice-born classes” as it 
probably did in the Manusm/ti, “The Law Code of Manu”, 2.207 (Olivelle 2004: 
242). The distinction between arya and anárya has not been a racial or ethnic 
distinction. Madhav Deshpande has shown that it expresses claims to moral, social 
and spiritual status, tending toward exclusion in so-called ‘Hindu’ legal texts and 
epics, but inclusion and transformation among Jainas and Buddhists (Deshpande 
1999). 

The mathematicians who wrote in Sanskrit, then, might have come from any- 
where and their native language may be anyone’s guess but they must have studied 
Sanskrit grammar. It is unlikely that they studied books. In India, no pandita or 
traditional scholar does. An aspiring savant may have been taught by his father or 
must have had a teacher, who had his own guru, etc. in the oral succession of gu- 
ruparampara, “the lineage of teachers". All that knowledge must ultimately have 
come from one of the many existing and surviving Sanskrit grammatical works. 
Two questions arise: which one did it come from and what did it say? 

Like other scholars, many mathematicians are likely to have studied the earli- 
est and most famous Sanskrit grammar: that of Panini of the fifth/fourth century 
BCE, or its later adaptations such as the Kagika of the seventh century CE or the 
Paniniyan grammar of Bhattoji Diskita of the seventeenth. Buddhists had their 
own grammars due to famous masters such as Candragomin of the fifth century 
CE, just as the Jainas had great grammarians from Devanandin (fifth c. CE) to 
Hemachandra (twelfth) and beyond (for more information on the Sanskrit gram- 
marians see Staal 1972). All these works were inspired, directly or indirectly, by 
the Paninian tradition. And all of them possessed not one but many zeroes. 

What is zero in grammar or linguistics? Panini had a technical term for it: 
lopa. He defined it as “something that does not appear" (adarsanam lopah 1.1.60). 
It is not a rare term in his grammar. Its “non-appearance” (adarsanari) does not 
prevent it from occurring in forty-five out of four thousand rules if I counted them 
correctly as they were enumerated by Bóhtlingk in his edition (1887, with many 
reprints: II: 271*). The actual number is higher since I have not taken account of 
Bóhtlingk's uses of the expression fgg which indicates “and following". 

Professor P.P. Divakaran, who commented on an earlier draft of this article, 
was intrigued by the definition of /opa as adarsanam because Panini certainly 
lived before A$oka who presumably introduced writing in the third century BCE 
and: “I should think that a sound which is absent would be characterised by Panini 
as unheard or unsounded rather than as unseen”. 


lThe philologist’s concept of lectio difficilior, “the more difficult reading,” should be extended so 
as to be applicable to the oral transmission of compositions: the more difficult and unexpected of two 
readings or orally transmitted forms, viz., Aryabhata, must be preferred to the normal, expected form 
Aryabhatta, since it is unlikely that it is based upon an error. 
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This is an apt observation but there is more to say. Some interpreters have in- 
deed translated Panini’s adarSanam as ‘unseen’ since the verbal root drs- certainly 
means “to see". I translated it differently as “something that does not appear" like 
others have done because that same root is widely used in the much wider mean- 
ing of seeing with one’s mind. It includes perception, observation, appearance, 
knowing, etc., and is a common meaning in Sanskrit and similarly in other lan- 
guages. In Indian philosophy, the six traditional systems are called darsana but 
their epistemology is not confined to seeing with one’s eyes. In English we say: 
"I see what you mean”. In later Sanskrit, na drsyate means: “it does not appear 
(that such-and-such is the case)". In English the same ambiguity applies to appear 
itself: "there appeared a large bird on the roof" suggests that the bird was actually 
seen but in "there appears to be much confusion about the PM of Thailand step- 
ping down", nothing may have been peiceived by eyes or eyes only; it is a topic 
that people are talking about. 

According to Renou's dictionary of grammatical terms in Sanskrit (Renou 1957 
under adarsana, lopa and lup-), adarsana means amuissement, a technical term 
in French which expresses that a phoneme is dropped in pronunciation. Renou 
refers to Katyayana, grammarian who commented on Panini and lived some two 
or three centuries later, probably during the reign of Asoka (a fuller but some- 
what opaque discussion occurs in Cardona: 1976, 1980, pages 267-273). That 
date could support an interpretation that refers to writing, but Katyayana himself 
does not see that way. Whether it was written or not, his statement is startling: 
adarsana means not seen, not heard, not pronounced, not perceived, absence or 
disappearance. It calls for a closer analysis and the entire subject seems to stand 
in need of a new and thorough revision but I think that, in the present context, 
we may safely conclude that drs- does not only mean “to see with one's eyes", 
that adarsanam does not only mean “unseen” and that “something that does not 
appear” is the best translation—for the time being. 

Mathematicians studying Sanskrit in order to be able to compose works with 
all-India appeal, could not have missed the numerous zeroes that occur in Sanskrit 
grammars. Modern readers are able to understand them provided they know some- 
thing of the morphology of words in Sanskrit. It is found in other Indo-European 
languages such as English; and in others. The examples that follow below under 
(1)-(3) occurred in three papers by Pandit: 1962, undated and 1990 (not seen), all 
of which, like Allen 1955, suffer from the complex constructs of outdated linguis- 
tic systems. No such defects disfigure Shefts 1961 who treated (2) and (3) and was 
reviewed in Staal 1963a (reprinted in 1988: 232-237). 

Before we begin I must mention that Sanskrit does not use an explicit pronoun 
like English. The verbal form khadati does not mean: “eats” but “he eats". That 
“he” disappears when there is a subject as in ramo khadati which means “Rama 
eats” and not “Rama he eats” (where the asterisk express ungrammatically) What 
is important in our context is that Sanskrit distinguishes like English between the 
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his name would have been Aryabhatta!); and neither need all brahmans know 
Sanskrit. Arya, moreover, does not refer to “the three twice-born classes” as it 
probably did in the Manusmrti, “The Law Code of Manu", 2.207 (Olivelle 2004: 
242). The distinction between drya and anàrya has not been a racial or ethnic 
distinction. Madhav Deshpande has shown that it expresses claims to moral, social 
and spiritual status, tending toward exclusion in so-called *Hindu' legal texts and 
epics, but inclusion and transformation among Jainas and Buddhists (Deshpande 
1999). 

The mathematicians who wrote in Sanskrit, then, might have come from any- 
where and their native language may be anyone’s guess but they must have studied 
Sanskrit grammar. It is unlikely that they studied books. In India, no pandita or 
traditional scholar does. An aspiring savant may have been taught by his father or 
must have had a teacher, who had his own guru, etc. in the oral succession of gu- 
ruparampara, “the lineage of teachers". All that knowledge must ultimately have 
come from one of the many existing and surviving Sanskrit grammatical works. 
Two questions arise: which one did it come from and what did it say? 

Like other scholars, many mathematicians are likely to have studied the earli- 
est and most famous Sanskrit grammar: that of Panini of the fifth/fourth century 
BCE, or its later adaptations such as the Kasika of the seventh century CE or the 
Paniniyan grammar of Bhattoji Diskita of the seventeenth. Buddhists had their 
own grammars due to famous masters such as Candragomin of the fifth century 
CE, just as the Jainas had great grammarians from Devanandin (fifth c. CE) to 
Hemachandra (twelfth) and beyond (for more information on the Sanskrit gram- 
marians see Staal 1972). All these works were inspired, directly or indirectly, by 
the Paninian tradition. And all of them possessed not one but many zeroes. 

What is zero in grammar or linguistics? Panini had a technical term for it: 
lopa. He defined it as “something that does not appear" (adarsanam lopah 1.1.60). 
It is not a rare term in his grammar. Its *non-appearance" (adarSanam) does not 
prevent it from occurring in forty-five out of four thousand rules if I counted them 
correctly as they were enumerated by Bóhtlingk in his edition (1887, with many 
reprints: II: 271*). The actual number is higher since I have not taken account of 
Bohtlingk’s uses of the expression fgg which indicates “and following". 

Professor P.P. Divakaran, who commented on an earlier draft of this article, 
was intrigued by the definition of lopa as adarsanam because Panini certainly 
lived before ASoka who presumably introduced writing in the third century BCE 
and: “I should think that a sound which is absent would be characterised by Panini 
as unheard or unsounded rather than as unseen”. 


IThe philologist’s concept of lectio difficilior, “the more difficult reading,” should be extended so 
as to be applicable to the oral transmission of compositions: the more difficult and unexpected of two 
readings or orally transmitted forms, viz., Aryabhata, must be preferred to the normal, expected form 
Aryabhatta, since it is unlikely that it is based upon an error. 
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This is an apt observation but there is more to say. Some interpreters have in- 
deed translated Panini’s adarsanam as ‘unseen’ since the verbal root drs- certainly 
means “to see". I translated it differently as “something that does not appear" like 
others have done because that same root is widely used in the much wider mean- 
ing of seeing with one’s mind. It includes perception, observation, appearance, 
knowing, etc., and is a common meaning in Sanskrit and similarly in other lan- 
guages. In Indian philosophy, the six traditional systems are called darsana but 
their epistemology is not confined to seeing with one’s eyes. In English we say: 
"I see what you mean”. In later Sanskrit, na drsyate means: “it does not appear 
(that such-and-such is the case)". In English the same ambiguity applies to appear 
itself: "there appeared a large bird on the roof" suggests that the bird was actually 
seen but in "there appears to be much confusion about the PM of Thailand step- 
ping down", nothing may have been peiceived by eyes or eyes only; it is a topic 
that people are talking about. 

According to Renou's dictionary of grammatical terms in Sanskrit (Renou 1957 
under adarsana, lopa and lup-), adargana means amuissement, a technical term 
in French which expresses that a phoneme is dropped in pronunciation. Renou 
refers to Katyayana, grammarian who commented on Panini and lived some two 
or three centuries later, probably during the reign of Asoka (a fuller but some- 
what opaque discussion occurs in Cardona: 1976, 1980, pages 267-273). That 
date could support an interpretation that refers to writing, but Katyayana himself 
does not see that way. Whether it was written or not, his statement is startling: 
adarsana means not seen, not heard, not pronounced, not perceived, absence or 
disappearance. It calls for a closer analysis and the entire subject seems to stand 
in need of a new and thorough revision but I think that, in the present context, 
we may safely conclude that drs- does not only mean "to see with one's eyes", 
that adarsanam does not only mean “unseen” and that “something that does not 
appear" is the best translation—for the time being. 

Mathematicians studying Sanskrit in order to be able to compose works with 
all-India appeal, could not have missed the numerous zeroes that occur in Sanskrit 
grammars. Modern readers are able to understand them provided they know some- 
thing of the morphology of words in Sanskrit. It is found in other Indo-European 
languages such as English; and in others. The examples that follow below under 
(1)-(3) occurred in three papers by Pandit: 1962, undated and 1990 (not seen), all 
of which, like Allen 1955, suffer from the complex constructs of outdated linguis- 
tic systems. No such defects disfigure Shefts 1961 who treated (2) and (3) and was 
reviewed in Staal 1963a (reprinted in 1988: 232-237). 

Before we begin I must mention that Sanskrit does not use an explicit pronoun 
like English. The verbal form khadati does not mean: “eats” but “he eats". That 
"he" disappears when there is a subject as in ràmo khadati which means “Rama 
eats" and not “Rama he eats" (where the asterisk express ungrammatically) What 
is important in our context is that Sanskrit distinguishes like English between the 
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stem and the suffix or ending of a verb. From the stem khad- (“eat-”) and the 
ending -ti (“-s”) it forms: 


khad-a-ti (he) eat-s" (1) 


What is -a- ? It is something in between which we may call an infix. I put hyphens 
between the three elements in the Sanskrit expression to distinguish the stem and 
the ending, which we find in both Sanskrit and English, and highlights the infix 
we only find in Sanskrit. 

There are, however, various classes of verbs in Sanskrit. One of them has a 
verb with the same meaning but lacks the infix: 


ad-ti (he) eat-s" (2) 


which becomes aiti which is due to what is sometimes called "assimilation". 
Panini's grammar is a list of rules (sūtra). (1) follows from a general rule. (2) 
seems to illustrate a special case. However, (1) and (2) express similar properties 
and the underlying rule is the same if we adopt a principle called laghava, literally 
“lightness”. (2) is now expressed as: 


ad-0-ti "(he) eat-s” (3) 


The symbol "0" which indicates absence of the infix is the lopa defined as *some- 
thing that does not appear". Here we meet the grammatical or linguistic zero. It 
occurs not only in verbs but also in nouns and it should be obvious that there are 
many of them. 

Panini's rules are generally ordered and he could have reversed the order, i.e., 
start with (2) and derive (1) by inserting the infix. Such problems are discussed 
by commentators and other grammarians, including Joshi and Kiparsky 1979 and 
Kiparsky 1991 who have shown that “lightness” is not simply an abbreviation but 
expresses generalization. It explains the famous saying: “grammarians rejoice 
over the saving of half a syllable as over the birth of a son". My examples do no 
more than illustrate the thesis, that the many linguistic zeroes of Sanskrit grammar 
led mathematicians to muse about one or more mathematical zeroes. 

Indic mathematicians did not only study Panini to compose works with all- 
India appeal. They were a small group of people, not popular or prestigious like 
mathematicians in the modern world. In India there was only one "science of 
the sciences" (Sastranam Sastram), the boundless (anantaparam), the summit 
of learning: grammar or vyakarana which literally means "analysis". Mathe- 
maticians were flattered to be associated with such a prestigious tradition. In the 
modern world, the opposite holds. Grammar is not a popular subject and many 
scientists do not even know what "linguistics" means. It developed as a serious 
discipline only after the discovery of Sanskrit by William Jones, Charles Wilkins' 
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Sanskrit grammar and Franz Bopp’s adoption of the methods of the Indian gram- 
marians in his “Systems of Conjugation in Sanskrit compared with those in the 
Greek, Latin, Persian and Germanic Languages” of 1808 (see Thieme 1982/83, 
Staal 1989 and 1993, Chapters 4 and 5, and 1995: 109). 

The histories of linguistics, logic and the theory of ritual in India and Europe 
were first compared in Staal 1988: 36-45. It presented graphs of developments 
that gave a rough idea of the ups and downs. Logic included mathematical logic, 
but I did not then, and would not now be able to include mathematics. ‘The 
Theory of Definition in Indian Logic” (1961, reprinted in 1988:90) referred to 
the occurrence of a kind of null-class presupposed in “modern” works such as 
the Tarka-sarngraha of the seventeenth century AD. But in India, linguistics and 
logic were closely connected almost from the beginning. 

Panini distinguished different zeroes from each other by making use of a rich 
conceptual apparatus. He was aware of the fact that the language of his grammar 
was modeled in part after the language that was the object of his study: Sanskrit. 
It could lead to confusion unless the two were clearly distinguished. He therefore 
made a distinction between "rules" (sūtra) and “metarules” (paribhasa). Implic- 
itly working with what we would now call a metalanguage, Panini made explicit 
use of meta-linguistic markers which he called it. To distinguish the different ze- 
roes from each other, he made use of the fact that lopa comes from a verbal root 
that starts with "I" and to which we shall return. The meta-linguistic markers al- 
ways have an “I” that marks them as dealing with zeroes such as luk, lat, lit and 
slu, each defined for particular classes or special cases. In modern transliterations, 
they are indicated by capitals (which Sanskrit has no means of distinguishing from 
small letters). The uses of metalanguage in Sanskrit grammar have been studied 
by Scharfe 1961 and Staal (1963b, 2003: 353-6); rules and metarules arestudied 
separately in Staal forthcoming. 

English grammar does not use such meta-linguistic markers but it could do 
something similar. It may be illustrated with the help of a rough sketch of English 
noun pluralization (a formalized grammar of such a topic may look quite different 
and require a substantial book). We shall begin with a general rule, where P is the 
plural marker: 


noun + P > noun + suffix (e)s 


This is a context-sensitive rule in which > stands for “is replaced by”; + stands 
for concatenation; and parentheses express options that distinguish dogs from 
witches. The general rule as stated does not account for fish or sheep which require 
a zero-suffix. 

My account, so far, applies to written English. It does not explain different 
pronunciations of the written s, which may sound like “s” or “z”. If we try to 
account for pluralization in both written and spoken English we need a greater 
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variety of expressions. I shall not belabor the point but Panini’s way with meta- 
linguistic zero-suffixes for special cases may be illustrated again for English by 
using subscripts as linguistic markers, for example: 


man--0; > men 
woman +02 > women 


mouse +03 > mice. 


The spelling of English is idiosyncratic but that of Sanskrit, in that respect 
closer to Italian or Spanish, is rational. It is adopted by all Indic syllabaries in 
South and Southeast Asia and in the Roman transliteration adopted by Sanskrit 
scholars worldwide. That transliteration writes the ou of English mouse as au and 
the i of mice as ai. They are part of an extended system with similar sound corre- 
spondences in Sanskrit and Indo-European. Sanskrit derives from nouns such as 
Siva the adjective Saiva which, in English, became "Shivaite". Similarly, the noun 
rudra produces the adjective raudra to which no English adjective corresponds. 


4. Zeroes in the Srauta Ritual 


I shall end our discussion with the Vedic Srauta ritual which belongs to the pre- 
history of zero as well as that of Sanskrit Grammar. Vedic ritual is, therefore, a 
parent as well as a grandparent of zero. All forms of Vedic ritual are concerned 
with recitations, chants and acts. Recitations come from the Rigveda, chants from 
the Samaveda and ritual acts are the chief concern of the Yajurveda. Vedic ritual 
became a science or theory (Staal, forthcoming) in one of its later forms which de- 
veloped between roughly the tenth and seventh centuries BCE and became known 
as the Srauta ritual. It was an oral tradition. 

Srauta ritual was the most creative of Vedic rituals. By that time, practice 
and theory had become highly developed disciplines and were closely connected 
with each other. That development did not take place among other higher animals 
which perform rituals but do not have theories because they do not possess lan- 
guage. Human ritualists are able to talk about ritual with each other, introduce 
modifications and add refinements. They started thinking ritually; but it did not 
happen often. Srauta ritualists may have been the first. Japanese theologians in- 
fluenced their rituals during recent centuries (Sharf 2003). Modern students have 
their own rituals but remain a motley crowd consisting of theologians, psycholo- 
gists, representatives of disjointed human and social sciences while ritualization 
among the higher animals, again including the human, continues to be taken care 
of by life sciences on the borderline. Many of these specialists are innocent of 
theory. It explains an important difference between sections 3 and 4. 
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Sanskrit grammar deals with Sanskrit and influenced modern linguistics. The 
Srauta ritual did not influence any modern theory of ritual because there is no 
such theory. It implies that we cannot adopt Needham’s rule as formulated on page 
3 above: we cannot do "the only thing we can do" because there is no yardstick. 
The reader will accordingly meet with unfamiliar concepts, methods and modes of 
analysis. Some are discussed in Chapters 7, 12 and 13 with their Source Notes in 
Staal 2008. But unless we are acquainted with one of the few surviving traditions 
of Srauta ritual and/or some of the literature in Sanskrit and in modern languages 
about Vedic ritual, we shall be on our own. 

Basic to any ritual performance are space and the four directions. The rit- 
ual arena consists of several sheds with thatched roofs that are temporarily con- 
structed for each performance. Some of the most important Srauta ceremonies 
are performed in a small space at the center that is called the Sadas. The word 
is derived from the Sanskrit verb sad- or “sit” which occurs in the contemporary 
term upanisad, "sitting close to" (the teacher). 

The Sadas looks as follows with the north on top: 


RIGVEDA » | « YAJURVEDA 


Vedas are recited in the four directions which the reciters themselves must also 
face. The above sketch makes use of two directions that are indicated in the figure 
by symbols we have used before but that now have a new meaning: > means 
"facing east" and « means "facing west". These directions raise a host of technical 
and theoretical problems, some of them discussed by Caland and Henry (1906-7: 
232) and Keith (1914: I, 252 note 4). I shall mention two. The first is concerned 
with the directions only. The second combines directions with the verbal root from 
which /opa derives. 

The first case is illustrated by the sketch. It depicts a change of directions that 
has just taken place. It does not involve the Rigveda which is recited inside the 
Sadas by an officiant who is already sitting there, facing and reciting east. I shall 
call him R. The Yajurveda is recited by a priest I shall call Y, but he has come from 
outside the Sadas and cannot easily enter. He has made several turns already and 
will eventually face west and face R: the auspicious result that is depicted here. 
Earlier, the two officiants did not only fail to face each other but Y sat with his back 
to R - a situation that is to be avoided at all costs. The entire episode illustrates 
how the rivalry between different factions may be resolved and overcome. 
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The stage is now set for the second case. A dialogue unfolds which is initiated 
by R who recites a proposal: 


"Let us both recite!” (Somsavo) 


Y responds from outside the Sadas with a touch of flattery. His verse consists 
of two halves: 


"Let us both recite, divine one!" ($omsavo daiva) 


“Recite! Let us both rejoice!” (samsa madeva) 


Suppose Y were to omit the second half-verse. I have never witnessed it but it 
must have happened often because the two halves are very similar. The problem 
is addressed by the Yajurveda: "if the response after the half-verse were omitted 
(lupyeta) it would be like someone being left behind by others who are running 
ahead" ( Taittiriya Samhita 3.2.9.5). Here we have an instance of the verbal root 
lup- from which lopa is derived. The general meaning of the verb is “disappear” 
or "get lost". 

Louis Renou, whom I mentioned before, was the first to draw attention to the 
numerous ritual uses of lup- in his 1941-1942 study on the connections between 
Sanskrit grammar and Vedic ritual (465, nore 83). It is also Renou, “the most com- 
plete Sanskritist" as he was called by V. Raghavan (1956: 20), who first demon- 
strated the historical precedence of ritual over grammar. 

I conclude that ancient India reverberated with zeroes, zero entities and zero 
events long before the geometry of the Sulva Sütras which are post-Buddhist. 
What may be called the prehistory of zero was expressed in early Vedic by kha 
which refers to cavities of various sorts and occurs in the Upanisads in the sense 
of "space". The Srauta Sütras, late Vedic but pre-Buddhist, used lopa to refer to 
omissions, disappearances and things that are lost. It is here that the origins of the 
mathematical concept of zero seem to lie. We do not know where it happened if it 
happened only once, but the most likely place would be the Kuru region north of 
modern Delhi though it may have been further east in Magadha, which overlaps 
with modern Bihar. The time must have been after 1,000 and before 600 BCE when 
the creative period of the Srauta ritual was over. It is a long period with smudgy 
edges but there it is. 

Reverting finally to arithmetics there are important questions that I have not so 
far considered. How was zero conceived as a member of the number series? Takao 
Hayashi has suggested that it may be related to additions such as 15 + 20 = 35 (in 
modern symbols) which presuppose 5 + 0 = 5 plausible enough but when did it 
happen? Or could zero be related to the recursive principle underlying the decimal 
number names (and hence their construction) which, according to P.P. Divakaran, 
was perfectly well understood in the early Rigveda? Plausible also, but would it 
imply that zero was conceived as the beginning of the infinite series of natural 
numbers, so that one would count 0, 1, 2, 3 etc.? 
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Let us return once more to our explanation of the origin of the mathematical 
zero in terms of the assignment of special meanings to Sanskrit terms such as lopa, 
Sunya or bindu. We are fortunate to possess the records of such events. Similar 
words occur in other human languages but did not undergo a similar development 
as far as I know. In their Indic evolution, /opa was inspired by ritual but is that a 
necessary part of its prehistory? All we know is that it added the flesh of another 
empirical discipline to the bones of linguistics, the discipline that underlies our 
understanding of the development of language. As far as I can tell, thus far, two 
conditions must be satisfied before a concept of zero may arise: there needs to be 
a language as well as another formal structure in which that language is used to 
signify that something has disappeared or is lost. 
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Combinatorial Methods 
in Indian Music: Pratyayas 
in Sangitaratnakara of Sarngadeva 


Raja Sridharan, R. Sridharan and M. D. Srinivas 


1. Introduction 


Six combinatorial tools (called pratyayas) have been in systematic use in India for 
the study of Sanskrit prosody ( Chandas-§astra) and these go back in time at least 
to Pingala (c. 300 BC?). Among these, three—prastara (an enumeration rule for 
generating all the possible metrical patterns of a given class as a sequence of rows), 
uddista (the process for finding, for any given metrical pattern, the corresponding 
row number in the prastara) and nasta (the converse of uddista)—are found in 
Bharata’s Natyasastra, in the chapter where prosody is discussed. Incidentally, 
the problem of placing Pingala and Bharata in the chronology of time still re- 
mains an unsettled question. The notion of pratyayas was perhaps discussed in 
other ancient texts of music also. However, the first extant text on music where the 
pratyayas are systematically dealt with, both in connection with patterns of musi- 
cal phrases (anas) and patterns of musical rhythms (talas), is Sargitaratnakara 
of Sarngadeva (c.1225 AD). Narayana Pandita in his Ganitakaumudt (1356 
AD) deals with some of these questions in a more general context, though his 
theory does not cover the kind of tala-prastara considered by Sarrigadeva. 

Our aim in this article is to highlight the contributions of Sárrigadeva and ex- 
plain his work in a mathematical set up. We first discuss the sequential generation 
or enumeration of patterns of musical phrases, called tana-prastara. The method 
of generating these patterns, as discussed in the first chapter of Sarigitaratnakara, 
is essentially a rule for generating sequentially the n! permutations of n symbols. 
We note that the prastara, and the nasta and uddista processes are all indeed 
encoded in a certain unique representation of any integer in terms of sums of fac- 
torials. We also explain how Sarrigadeva employs a tabular figure, khanda-meru, 
to essentially go back and forth between any integer and its representation as a 
sum of factorials. 

We then move on to discuss the pratyayas for patterns of musical rhythms, 
tala-prastara. This theory has been dealt with at length in the sixth chapter of 
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Sangitaratnakara. lt is in fact a generalisation of the theory of pratyayas for 
moric metres or matra-vrttas, where the short syllable (laghu) is taken to be of 
one matra (metrical time unit) and the long syllable (guru) is taken to be of two 
matras. Sangitaratnakara considers musical rhythmic patterns (tālas) made up 
of druta (of one time unit), laghu, guru, pluta, which are of 2, 4 and 6 durations 
respectively, of that unit of druta. 

It should be noted that the notion of pluta, viewed as a prolated vowel, consid- 
ered to be of the same duration as three laghus, appears already in ancient Sanskrit 
prosody. In fact the notion of pluta occurs in the Rgveda only three times, though 
much more frequently in other Samhztas and Brahmanas. In classical Sanskrit, 
a pure consonant sound called a vyarijana (without any vowel attached) is said to 
be of half a matra. A laghu is said to be of one matra, guru of two matras and 
a pluta of three matras respectively. Clearly, as it is clumsy to handle fractions in 
music, time units of 1, 2, 4 and 6 were perhaps introduced and called druta, laghu, 
guru and pluta respectively. 

Sangitaratnakara first presents a systematic method of enumerating all the 
talas of a given time duration in a prastara, and follows this up with a complete 
mathematical theory of pratyayas which is a generalisation of the corresponding 
theory for moric metres. An interesting feature of ta@la-prastdra is that the total 
number of patterns (the samkhyarika), if laid out in a sequence, has a generating 
function which involves a polynomial of the sixth-degree. This is an analogue of 
the notion of samkhyas for the moric metres, which are given by the so-called Fi- 
bonacci numbers (a result which is already present in the seventh century Prakrta 
text on prosody, Vrttajatisamuccaya of Virahanka), which have as their gener- 
ating function the inverse of the quadratic polynomial (1 — x — x*). In fact the 
sequence of samkhyarikas associated with tala-prastaras satisfies a more com- 
plex (four term) recurrence relation and has the inverse of the sixth degree poly- 
nomial (1 — x — x? — xt — xô) as its generating function. Based on this generating 
function, we can compute the number of tāla patterns in the prastara, which have 
a given number of drutas, laghus, gurus or plutas. This is an analogue of the 
pratyaya known as ekadvayadi-lagakriya in Sanskrit prosody, and is discussed 
in Sarigitaratnakara for the tala-prastara by introducing various tabular figures 
called merus, which are constructed on the basis of systematic recurrence rela- 
tions which can be derived from the generating function mentioned above. 

The discussion of pratyayas in prosody and music lead to the study of com- 
binatorics related to three important ways of representing any non-negative inte- 
ger, representations which are widely in use even today.! While discussing the 
pratyayas for varna-vrttas, Pingala gave the procedure for finding the binary 
representation of integers. Much later, Narayana Pandita, in his Ganitakaumud? 


l See for instance, A. S. Fraenkel, Systems of numeration, Amer. Math. Monthly 92 (1985), 105- 
114. 
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(c. 1356 AD), generalised the method of Pingala for an arbitrary radix. The dis- 
cussion of pratyayas for matra-vrttas led to the discovery of the so called Fi- 
bonacci representation of integers which expresses any integer uniquely as a suit- 
able sum of Fibonacci numbers. In his discussion of tala-prastara, Sarrigadeva 
introduced a generalization of the Fibonacci representation, where the Fibonacci 
numbers are replaced by the sarnkhiyürikas of tala-prastáras. Another form of 
generalization was considered later by Narayana Pandita. As we already noted, 
Sarrigadeva, in his construction of khanda-meru, implicitly employed the so 
called factorial representation of integers in his discussion of the pratyayas for 
tana-prastara. 


2. Tana-Prastara 


Sangitaratnakara of Sarngadeva (c.1225) is one of the seminal texts of Indian 
music. It acts as a bridge between the ancient texts like Natya-Sastra, Brhaddesi 
and Dattilam and all the major later texts. As Sarngadeva states, his family 
originally came from Kashmir and his grandfather, father and he himself received 
patronage from king Singhana (1210-1247) of the Yadava dynasty ruling from 
Devagiri in Maharashtra. 

Chapter I of Sarigitaratnakara is the Svaragatadhyaya. In this Chapter, the 
fourth section is entitled Grama-murcchana-krama-tana-prakarana. Of the 91 
verses of this section, verses 61—70 deal with the pratyayas associated with tāna- 
prastara. In the following discussion, we give the textual verses,” which discuss 
each of the pratyayas, along with a brief paraphrase? and necessary mathematical 
explanation. 

A tana is a sequence of the seven svaras (basic musical tones or notes) which 
we denote in their natural order by S, R, G, M, P, D, N. In this section, Sarnga- 
deva considers the problem of enumeration of all the possible tanas where each 
svara appears only once.’ For instance, if we consider the four svaras S, R, G, M, 


2We have used the following edition of the work: Sarigitaratnakara of Sarngadeva, edited with 
Kalanidhi of Kallinatha and Sarigitasudhakara of Simhabhupala, by S. Subramanya Sastri, 
2'd Ed. revised by S. Sarada, Adyar Library, Madras: Vol. I, 1992; Vol. III, 1986. The excellent 
commentaries of Simhabhüpala (c. 1350) and Kallinátha (c. 1420) present lucid explanations of 
the text along with examples. 

3English translation of the verses on T'ànaprastara, in chapter I (Svaragatadhyaya) of the text, 
are available in: Sarigitaratnakara of Sarngadeva edited with English translation by R. K- Shringy 
and Prem Lata Sharma, Vol. I, Motilal Banarsidass, Varanasi 1978. English translations of the verses on 
Talaprastara, in Chapter V ( Talàdhyaya) of the text, are available in: Akella Mallikarjuna Sarma, 
Tala Prastüra of Nissarnka Sürrgadeva's Sarigitaratnákara, Sannidhi Sangita Publications, Hy- 
derabad 2001. 

4Narayana Pandita in his Ganitakaumudi has generalised the theory of ténaprastdra to 
include tanas which involve repetition of svaras also. 
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there are 24 tänas, which represent the various possible permutations of them. 
Sárhgadeva gives a systematic method of enumerating these permutations in a se- 
quence. Before discussing his procedure for enumeration of the tanas, Sarngadeva 
first states the total number of number of t@nas—this is the first pratyaya known 
as the samkhyà which is to count the number of tänas with a given number of 
svaras: 


3rgreetfereredrqedaped fenung FAT 
ga yao yor dy wr ut ui 
rR VAT wer yeaa | 


AARNA: 2.4.6 9-2) 


The above verses state that the total number of £anas with r svaras is given by 
the product 1.2... ", forr = 1,2,...7. This is nothing but the statement that the 
total number of Bee tions that can be formed from r distinct svaras is r!. 


2.1. The rule for the construction of the Prastara 


PH UG ER: wur Wd: Wd: WEY: | 
q qeu aed: meaa: 1 
HEPARA YÈ Sur WAR FST: | 


AANE: 2.4.69-3) 


The purport of the above verses is the following: All the tanas formed from 
a given set of distinct svaras can be enumerated in the form of an array in the 
following manner. In the first row the svaras are written in the ascending order. 
Successive rows in the prastàra are generated by the following rule: In each row 
of the prastara, starting from the left, identify the first svara which has at least one 
lower svara to the left. Below that svara, place the highest of these lower svaras. 
Bring down all the svaras which are to the right, as they are, to the next row. To 
the left, place all the remaining svaras in the ascending order. This completes the 
next row of the prastara. Continue till the last row is reached where all the svaras 
will be in the descending order. 

In order to illustrate this rule, we may consider the case of four 
svaras: $, R, G, M. The first row of the prastara is formed by placing them in 
the usual or ascending order S RG M. In order to form the next row, we scan the 
first row from the left and identify R as the first svara which has a lower svara 
S to the left. This S is placed below R. The G and M, to the right, are brought 
down as they are. The remaining svara R is placed to the left, thus completing 
the second row as RSGM. To get the third row, the second row is scanned from 
the left and G is identified as the first svara, which has a svara lower to the left. 
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Here there are actually two svaras, R, S, which are lower and are to the left of G, 
and the highest of them R is placed below G. The svara to the right M is brought 
down. The remaining svaras, S and G, are placed in the ascending order to the 
left, thus completing the third row as $C RM. And so on. The entire prastara is 
shown in Table 1. 

Sarügadeva's rule for the construction of the prastara can be presented in a 
general context by considering the enumeration of the permutations of n distinct 
elements with a natural order, which we may assume to be aj, a2, 43, ..., dn. The 
prastara of these is an n! x n array of n symbols in n! rows, which will be denoted 
by [a1, a2, 43, ...a,]. We will also denote by [aj], the 1 x r column vector with all 
the entries given by aj. 


a; 


| 9 || [S ea 
[10] [MS [RI GT |as [|a |a | as | 
[Hn] TRIM] S |G] [a [as Jai | as | 
12 || |MIR[S[G] | as ja | ai |as | 
I3] [$|] G|[M|R] |a | as |as | a2 | 
[i44] JG] S [MR] [as ai | as | @ | 
[is || [SIMIGIR] [a | as |as |a 
rig [MS |o |R| [afaa tan] 
17| [G|M]S]R] [a3] ag | a |a | 
s| [M[G|[S [RI [a4] as | ai |a | 
| TRIGIM]S | |a |as | as | a | 
|L[G|RIM(|S | |as |a |as |a | 
[21| [|R[M[G[S]|]a |as [as | ar | 
[22] |M[R[|G|S | |a |a | as | at | 
233] [GTM] RIS] Jas [as | ae [ai | 
[24] [M]IG RIS J | as | a3 | ae | ar | 


TABLE 1. Tana-prastara of SRGM. 


N 
EEEE 


We explain the rule of enumeration by an inductive procedure. To start with, 
the prastara [ai] of one element is simply the 1 x 1 column [a)];. The prastāra 
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[a1, a2] of two elements is the 2 x 2 array 


aj a2 


a2 a 


which can also be written as 


[a1] [a2]1 
[a2] [a1]1 


The prastara [a;, a2, a3] of three elements is a 6 x 3 array, which (in our 
notation) has the form 


[a1, a2][a3]2 
[a1, a3][a2]2 


[a2, a3][a1]2 


Having obtained the prastara of three elements [a], a2, a3], we can see that the 


prastara of four elements [a], a2, a3, a4], explicitly shown in the preceding Table 
1, has the structure 


[a1, a2, a3] [a4]e 
[a1, a2, a4] [a3]e 
[a1, a3, a4] [a2]6 
[a2, a3, a4] [a1]6 


By induction, the prastara [a], a2, ..., a4] of n elements can be shown to be the 
n! x n array given by 


[a1, a2, ..., àn-2, àn-1] [an](n—1)! 
[a1, a2, ..., àn—2, an] [an—1 ]in—1) 


[a2, a3, ..., an—1, an] [a1 ]in-1» 


Here, we first note that the n-th element changes for the first time (from an to 
ag —1) after (n — 1)! rows. The permutation with row-number (n — 1)! has the form 
(Gn—1, dn—2-.-, 41, An) in which the first n — 1 elements appear in the reverse order, 
followed by an. In this permutation, only the last two elements are in the ascending 
order and, following Sarigadeva’s rule for prastdra, the succeeding row (with 
row-number (n — 1)! 4-1) is constructed by placing the largest of these elements (in 
this case an—1) below a; and placing the rest of the elements in the ascending order 
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at the left. This is precisely the permutation (a1, a2, ..., 452. An, An—1) which 
appears in the above general form at the row numbered (n — 1)! + 1. By induction, 
it can be seen that the array given above is indeed the prastara of n elements 
according to the construction of Sarngadeva. 

We may also note that the above prastara enumerates all the permutations 
of n distinct elements in an order which is different from the so-called lexico- 
graphic order, which is perhaps more commonly used in modern combinatorics. 
Sarngadeva’s rule actually enumerates the permutations in the what is sometimes 
referred to as the colex order—the mirror image of lexicographic order in the re- 
verse. 


2.2. A “factorial representation" of integers 


After discussing the prastara or the enumeration process, Sàrügadeva discusses 
the pratyayas, uddista and nasta. Uddista is the process by which, given a tana 
or permutation of the svaras, one finds the number of the row in which it appears 
in the prastara (or enumeration), without going through the process of generating 
the entire prastara. Nasta is the converse process by which, given the number of 
the row, the corresponding tana or the permutation is determined. The processes 
of uddista and nasta are discussed by Sarngadeva in terms of what is called 
the khanda-meru, and are essentially based upon a certain representation of any 
integer uniquely as a sum of factorials. 

In this section we shall outline some mathematical aspects of this representa- 
tion, which is a variant of what is widely known as the factorial representation of 
integers. We shall then consider how Sarhgadeva's construction of prastara and 
the uddista and nasta processes are encoded in this representation. We begin with 
the following: 

Proposition 1: Every integer 1 € m < n! can be uniquely represented in the 
form 


m = do0! + dl! + d22! + ... + di -1(n — 1)!, (1) 


where dj are integers such that do = | and 0 € d; € i, fori = 1,2,...,n — l. 

Proof: We prove the existence of such a representation by actually giving an 
explicit procedure by which every integer 1 € m < n! can be represented in the 
form (1) given above. First, d; —; is obtained as the quotient in the division of m 
by (n — 1)!, where we stipulate further that, if m is divisible by (n — 1)!, then the 
quotient is reduced by 1 and (n — 1)! is taken as the remainder. That is, dn—ı and 
r,—, are given uniquely by 


m = dy-\(n — 1)! + rn-i (2) 
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where, 0 € d,-; € n — l and0 < r,-, < (n —1)!. Next, we find d,—2 and 
rn—2 similarly by dividing r„—ı by (n — 2)!. And so on. When we finally reach r2, 
since 0 < m € 2, we set dı = 1 if r2 = 2 and dı = 0 if r2 = 1. Then, it follows 
that dọ = 1, and we have thus a systematic procedure by which every integer 
1 € m < n! can be represented in the form (1) given above. In particular, for any 
r € n we have a fascinating relation (with do = 1, d; =i, forall 1 <i € r — 1), 


riz 1.0!4- 1.1! 3-22! 4... (r — 1)(r — 1)! (3) 
The uniqueness of the representation (1) can be shown by induction on m. We 
note that since the uniqueness is clear for n — 1, 2, we may certainly start the 


induction. Assume that m > 2 and let 


m 


1.0! 4- dil! + d22! + ... + du 1 (n — 1)! 
1.0! + dj! + d521 + ... + d; (n — 1)! 


If d,—-1 = d; , #0, then 


m-—d,-j(n—1)! = 1.0!-- dil! 4 d22! 4 ... + d4 -2(n — 2)! 
1.0! + d 1! + d22! + ... + d/, (n — 2)!, 


n— 


so that (by induction on m) it follows that d; = d; for 1 < i < (n — 2) also. If 


d,-| # d,_ then we can assume without loss of generality that d,_) > d; , 
and we have 


m — d, (n — 1)! 1.0! + di1! + d22! + ... + d4 2 (n — 2)! 


+(dn-1 — d, (n — 1)! 
= 1.0!+d}!+d52!+...+d)_,(n —2)! 


If d; , > 0, then m —d; (n — 1)! < m, and we are through by induction. On the 
other hand, if d, 4 = 0, then since d,—, > 0 the first of the above two equations 
yields m > (n — 1)!, while the second (in view of (3)) yields m € (n — 1), a 
contradiction. This proves the proposition. 

Remark: The above representation (1) is in fact a variant of what is generally 
referred to as the factorial representation of integers, where every integer 1 € m < 
n!, has a unique representation of the form 


m= fill fo2! +... + fein — 1)!+ fant, (4) 
with 0 < f; € i, fori = 1,2,...n. If m = n!, we have f; = 0, fori < n 


and f, = 1; in other words, the above representation (3) becomes trivial when 
m =n!. On the other hand, the factorial representation (1), given by the above 
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proposition, reduces to the following relation (with d; = i, for all i), 
n! = 1.0! 4- 1.1!4+2.2!4+...4+(n—I1)(n—1)!, 


which has been mentioned already. 

Corollary: For any integer 1 < m < n!, theassignmentm — (dj, d2, . ..d4—1), 
given by equation (1), is a bijection of the set of natural numbers < n!, with the set 
of all (n — 1)-tuples of integers ((d1, d2, ...,d,—1)]0 € d; € i,i = 1,2, ..., n—1]. 


2.3. Uddista and nasta processes and the factorial representation 


We shall use the factorial representation discussed above to understand the pro- 
cesses of uddista and nasta as also the method of construction of the prastara. 
We shall first show that there is a one to one correspondence between the set 
(ai, , ai, ..., aj,) of all permutations of n distinct symbols and the coefficients 
(do = l,di,d2,..., du -1|]l € dj € i, i = 1,2,...,n — 1)) which arise in the 
factorial representation of numbers 1 < m < n!. As an illustration of this cor- 
respondence, we present in Table 2 the prastara [a), a2, a3, a4] of four distinct 
elements along with the coefficients dp = 1, d1, d2, d3 of the factorial representa- 
tion for each row-number m for 1 < m < 24. 

Given any permutation (a; (1), à; (2), - - -» 4o (n)) the corresponding set of co- 
efficients d1, d2, ...,dn—ı are determined sequentially, starting first with dn—1, 
by carefully following the inductive process that we described for generat- 
ing the prastára. As we mentioned earlier, the prastara for n elements 
[a1, a2, ..., a4 —1, Gn] can be written down using the prastara for n — 1 elements in 
the following manner: 


[a1, a2, ..., àn—2, àn-1] [an] —1)! 


[a1, a2, ..., 452, an] [an—1](n-1)! 


[a2, a3, ..., In—1, an] [a1] (n-1)! 


Each of the blocks in the above prastara has (n — 1)! rows. Thus, the last element 
in the first (n — 1)! rows is a, and this corresponds to d,-; = 0 in the factorial 
representation of the corresponding row-number. In the next (n — 1)! rows, the 
last element is a„—ı and this corresponds to d4-; = 1. And so on. Thus we 
see that if the last element of the given row is ag (jj where ø is a permutation of 
1, 2, ..., n, then the corresponding row-number will have a factorial representation 
with d,-; = n — a(n). Then, we remove the element a; (,) and consider the 
prastara of the remaining n — 1 elements and determine the coefficient d,—> in 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


64 Raja Sridharan, R. Sridharan, M. D. Srinivas 


the same way as we found d,—, by considering the last element a,(,—1) of the 
permutation (ag (1), dc (2), . . . ; às (n—1))- For this purpose, let ø (n — 1), be the j-th 
element when we arrange ø (1), ø (2), ..., c (n — 1) in the increasing order. Then, 
dn—2 = n — | — j. Again, let a (n — 2), be the k-th element when we arrange 
a (1), 0 (2), ..., a (n — 2) in the increasing order. Then, d,-3 = n — 2 — k. We 
continue this procedure obtaining d; up toi = 1, and then set do = 1. 


impe] ee]. | ^T [4o 2 
O| [ai [a [as [as | [1 | 0 | 0 
fe abe te 
| 3 || |a |as |a |as | [1j 
L4 || |as |a [a |as | 
EE CHET 
EN az | ay | a4 
| 7 || [afa a3 
8 || [a | a a3 
[9 || |a | as | a |a] | 
pro | [as [ai |a [as | | 
J | |a | aa | ay 
[32 || | a4 |a |a [as 
[| |a | a3 |a |a] | 
[14 [| [a [ar | aa | a | | 
[5 J| |a [as | as | 2 
[16 | |as |a |as | 2 
[17 || [as [ag [a |a 
eee 
[19 | [ao [as [as [a | | ! | 

| 20 | [a3 [a2 fasta | | 1 | | 
[21 | |a |a [as [|a] | ! | 
[2 | [as |a |a |a] | 1] 
| 23 | | a3 [ag [ao far | [ 1 | 0] 
| 24 | [as [a3 far far | |] 3] 1] 2] 3 | 


TABLE 2. Tana-prastara and Factorial Representation. 


N| N| — 


Having determined the coefficients do, dj, ..., dn—1ı associated with the given 
permutation (ac (1), a (2), ..., ac (n)), the corresponding row-numbern, in the pras- 


tara is given by 
m = do0! + di1! + d22! + ... + di 1 (n — 1)! 
This procedure for determining the row number of a given permutation is called 


(as in prosody) uddista. Sarngadeva follows a similar method for the uddista 
process which we shall describe later. 
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As an example of the above procedure, we may, for instance, compute the row 
number of the permutation (a3, a2, a4, a1) in the prastara [a;, a2, a3, a4]. Here 
a(l) = 3, o(2) = 2, o(3) = 4 and o(4) = I. According to the procedure 
outlined above, d3 = 4 — ø (4) = 4 — 1 = 3. The element a, (3) = a4, occurs as 
the third element of a2, a3, a4 and hence d = 3 — 3 = 0. The element a; (2) = a2. 
occurs as the first element of a2, a3 so that dj = 2— 1 = 1. Hence the row number 
of (a3, a2, a4, a1) is 1.0! + 1.1! + 0.2! + 3.3! = 20. 

We now indicate the converse process of nasta (or finding the “lost” row). 
which gives a procedure to find, for any integer m < n!, the m-th row of the 
prastara [a,, a2, ..., an]. We begin with the factorial representation of m: 


m = do0! + di1! + do2!+...+d,-\(n — 1)! 


We set o (n) = n — d,—, and take a5 (,) as the last element of the row. We re- 
move d; (4) from the set a), a2, ..., a, and arrange the rest of the elements in the 
ascending order. If j = n — 1 — d,—5, then we let the j-th element of this se- 
quence be the penultimate element az (4—1) of the m-th row. We next omit a; (n) 
and a5(,—,) and arrange the rest of the elements of the set a), a2, ..., a, in the 
ascending order. If k = n — 2 — d,_3, then the k-th element in this sequence 
will be the next element a,(,—2) in the m-th row. We continue this procedure till 
we use up all dj, 1 < i < (n — 1), and define a,(1) to be the remaining sym- 
bol left in a1, a2, ..., an- Thus the permutation in the m-th row is determined to 
be (ac (1), 46 (2), ---, dc (n)). This is the nasta process. Sarngadeva also follows 
essentially the same method as we shall see later. 

To illustrate the above procedure, we shall find the 15th row of the prastara 
[a1, a2, a3, a4]. We have 15 = 1.0! 4-0.1! 4- 1.2! 4-2.3!. The rule above shows that 
o (4) = 4 —2 = 2. The last element in the row is thus a2. The remaining elements 
in the increasing order of the indices are a), a3, a4. Since, o (3) = 3 — 1 = 2, we 
take the second element amongst them, namely a3, as the penultimate element of 
the row. Now we omit a3 also and are left with a),a4. Since, c (2) = 2 — 0 = 2, 
the next element of the row will be a4. The remaining symbol is a;. Thus the 15th 
row of the prastāra is determined to be (a1, a4, a3, a2). 

The factorial representation also sheds considerable light on the method of con- 
structing the prastara. To see this, we take one more look at the way the prastara 
of n elements is constructed from that of n — 1 elements. 


[a1, a2, ..., àn—2, An—1 [An] (n—1)! 
[a1, a2, ..., àn—2; An [An—1] (n—1)! 


[a2, a3, ..., @n—1, an][a1](n—1y 
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Here, we note that the n-th element changes for the first time (from a, to an—1) 
after (n — 1)! rows. And, at the same time, the co-efficient d, in the factorial 
representation changes from 0 to 1. In other words, for numbers 1 € m < (n — 
1)!, dj 2; = 0; and we have d,,-; = | for the first time when m = (n — 1)! 4- 1. 
Now, the number (n — 1)! has the factorial representation 


(n — 1)! 1.0! 4- 1.1! -2.2! 4- ... - (n — 2)(n — 2)! 4- O.(n — 1)! 


Here, we have (apart from the usual do = 1),d; = i for 1 < i € n — 2, and 
d,—| = 0. When we go to the next row, the representation is changed to 


(n — 1)! - 12 1.0! 4-0.1! 4- 0.2! 4 ...4+0.(n — 2)!+ 1.(n — 1)! 


That is, in passing from (n — 1)! to (n — 1)! + 1, the coefficient d,,—; increases by 
1, and the coefficients d; change to 0 for 1 < i € n — 2. By using induction on n, 
we can easily establish the following more general result: 

Proposition 2: Let m be an integer such that 1 < m < n!, with factorial 
representation 


m = 1.0! + dil! + d22! + ... + dn- (n — 1)!, 


with 0 < d; € i, fori = 1,2,...n — 1. If dy(k > 1) is the first coefficient in the 
above factorial representation such that dy < k, then the factorial representation 
for (m + 1) is given by? 


m 4 1 — 1.0! - d 1! 4- d221 +... d! (n — 1)!, 


n 


where d; = dj — i = Ofori < k, d, = dy +1, and d; = dj fori > k. 
This proposition gives the precise relation between the factorial representation 
and the rule of Sarngadeva for generating successive rows in the prastara, dis- 


cussed earlier. 
2.4. Uddista and Nasta processes as discussed by Sarngadeva 


2.4.1. Khanda-meru 


We now present the uddista and nasta processes as discussed by Sarngadeva 
and show how they are indeed based on the factorial representation of integers 


Sif it happens that d; = i for all 1 < i < (n — 1), then from (4) we have m = n!, and we are at the 
end of the prastara. 
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discussed above. Sarügadeva bases his discussion of the uddista and nasta pro- 
cesses through a tabular figure referred to as the khanda-meru, which is essen- 
tially a device to read off the co-efficients d; which arise in the factorial represen- 
tation of any given number and vice versa. The khanda-meru is described in the 
following verses: 


mrga: raat 
Tae feram d usu: HH d 
PETAR AA WISH Ha: | 


AINAT: 9.8.63-8) 


The tabular figure khanda-meru consists of seven columns (as we are deal- 
ing with the seven svaras in tana-prastara) with the number of entries increasing 
from one to seven, from left to right. In the first row, place | in the first column 
(from the left) followed by O's in the other columns. In the second row, starting 
from the second column, place the total number (or samkhya) of tanas of 1, 2, 3, 
etc., svaras. Thus the entries in the second row are the factorials, 1, 2, 6, 24, 120 
and 720. In the succeeding rows, place twice, thrice etc. (that is successive mul- 
tiples) of the factorials, starting from a later column at each stage. Thus, the first 
column of the meru consists of just 0! = 1, and the n-th column, for n > 1, is 
made up of the multiples i.(n — 1)!, with 0 € i € n — 1, as shown in Table 3. 


SS 


[110] SUS Eo 
pepe faa aan 720 
[| [4 12 [48 [240 | 140 | 
[| | [1872 | 360 2180 | 
L|]. .[95] 480 [2880 | 
L|]. [609 | 3609 | 
i S 0) 


TABLE 3. Khanda-meru. 


Thus, in the tabular figure of khanda-meru, Sarngadeva has a table of mul- 
tiples of all the factorials. He uses this effectively to find out the number of the 
row in a prastara that corresponds to a given tana (the uddista process), and con- 
versely (the nasta process), by precisely making use of the factorial representation 
that we discussed earlier. This will be clear from what follows. 
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2.4.2. Uddista Process 
Sàrhgadeva explains the uddista process as follows: 


TN FORA Wd WIRKT: FR: |d 
aerea ade sur aes fata 
SEIA KUTT oi mud ur [ 
Serrage ART 


agere: 2.8.66-C) 


Given a tana (of n svaras), note the rank of the last svara counting the svaras 
from the end in the natural order (i.e., counting the svaras in the reverse of the 
natural order). Mark the corresponding entry in the last or the n-th column of the 
khanda-meru. Note the rank of the next svara (in the reverse of natural order) 
among the remaining svaras. Mark the corresponding entry in the next or the 
(n — 1)-th column. And so on. The uddisíta or the rank-number of the given tana 
is the sum of the all the marked entries. 


TABLE 4. Khanda-meru for uddista. 


Example: Rank of the tana MSRG. 


e 4-th column: G is second of S, R, G, M in the reverse order, so mark the 
second entry 6. 


e 3-rd column: R is second among (the remaining) S, R, G in the reverse 
order, so mark 2. 


e 2-nd column: S is the second among S, R in the reverse order, so mark 1. 
e l-st column: M is left, and we mark the entry 1. 


Rank-number = 1] 4- 1 4-2 4- 6 — 10. 
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Clearly, the process of uddista, as set forth above by Sariigadeva, involves first 
working out the co-efficients d; which correspond to the given täna or permuta- 
tion. Then the khanda-meru is employed to find out the row-number associated 
with these co-efficients. In the case of the given tana, MSRG, the coefficients 
turn out to be d3 = d» = dj = 1. Then the corresponding row-number is found 
from the khanda-meru to be 


1.0! -- 1.1! - 1.2! 3- 1.3! 2 1-1 -2-F 6 — 10. 


2.4.3. Nasta process 


Sarngadeva explains the converse or the nasta process as follows: 


AA F . an 


Ta AAAA AA: | 
ay ae faa aeaa AT, | 
AR durum TÈ og 
wpa: careers dd 
rarer ran yaad 


(AIGNER: ?.2.&4-90) 


To find the tana (of n svaras) corresponding to a given rank-number or nasta- 
samkhya, mark that entry which is the greatest number strictly less than the rank- 
number in the n-th column of the khanda-meru. Subtract that entry from the rank 
number and mark the entry, which is the greatest number strictly less than the 
resulting number, in the next or the (7 — 1)-th column. And so on. The position 
of the marked entry in the last column gives the rank of the last svara of the fana 
in the reverse natural order. The position of the marked entry in the next column 
gives the rank of the last but one svara, amongst the remaining svaras, in the 
reverse natural order. And so on. 

Example: To find the 18'" tana in the prastara of SRGM 


e In the fourth column of the khanda-meru, the greatest number less than 18 
is 12, which occurs on the third row. The fourth svara of the tana is thus the 
third among S, R, G, M in reverse order, namely R. 


e Now, 18 — 12 — 6. The greatest number less than this in the third column 
is 4, which occurs in the third row. The third svara of the tana is the third 
among S, G, M in reverse order, S. 


e Now, 6 — 4 — 2. The greatest number less than this in the second column is 
1, occurring in the second row. Second svara of the tana is second among 
G, M in reverse order, G. 


e The first svara of the tana is the one left, M. 
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TABLE 5. Khanda-meru for nasta. 


The 18'” tāna is therefore MGSR. 

Clearly, the process of nasta, as set forth above by Sarngadeva, involves first 
working out the factorial representation of the given number. In the above exam- 
ple, the number 18 was decomposed as 


18-21-4-1-4-44-12 2 1.0! 4- 1.1! 2.2! 42.3! 


Here, the khanda-meru is used to straight away read off the coefficients 
di, d2, d3, d4, which occur in the factorial representation of the given number. 
Once these co-efficients are determined, the tana or the permutation can be ob- 
tained by the standard procedure discussed earlier. 


2.5. Historical Note 


Among the available texts, Sarigitaratnakara seems to be the earliest one to dis- 
cuss the subject of tana-prastara. The famous mathematical text Ganitakaumudz 
(c. 1356) of Narayana Pandita has a chapter on Arikapasa. Here, Narayana 
gives the same rule for generating the prastara of the set of all permutations of 
p digits (Ganitakaumudi XIII. 45-55). He also uses the same khanda-meru in 
order to discuss the nasta and uddista processes. As we have also remarked, the 
nasta process is essentially based on a factorial representation of integers. 

"Though, the text does not mention it explicitly, the rule for prastara given in 
Sarigitaratnàkarais also applicable for tanas with repetition of svaras. Narayana 
discusses this case explicitly, along with the corresponding nasta and uddista pro- 
cesses (Ganitakaumudi XIII. 55-61). 

In the European tradition, an algorithm for permutation generation is said to 
have been first given by C. F. Hindenburg in 1784 (D. E. Knuth: The Art of Com- 
puter Programming Vol. IV, Fasc IV, History of Combinatorial Generation, Addi- 
son Wesley, New York 2006, p. 61—2). 
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3. Tala-prastara 


Apart from Sarigita-samayasara of Parsvadeva (c.1165), which has a brief dis- 
cussion of tdla-prastara, Sarigitaratnakara is the oldest text which deals with 
tala-prastara. Its treatment of the subject is much more detailed and definitive 
than that found even in several later texts such as Sarigitopanisad-saroddhara 
of Sudhakarakalasa (c.1325), Nartana-nirnaya of Pandarikavitthala (c.1560) 
and Sarigita-darpana of Catura Damodara (c.1600). 

Chapter V of Sarigitaratnakara is the Taladhyaya with 409 verses. Starting 
with the definition of tala, the first 235 verses deal with marga-tdlas (classical 
rhythm patterns). Verses 236—311 discuss 120 desi-talas (regional rhythm pat- 
terns). At the end of this discussion, it is noted that there are indeed very many 
such £alas and it would not be possible to display all of them. This sets the stage 
for the prastára-prakarana which takes up the remaining nearly 100 verses of the 
Taladhyaya. 


3.1. Talangas 


The talangas, or the basic rhythm components, considered by Sarügadeva are 
four, namely, druta, laghu, guru and pluta, which may be taken to be of duration 
1, 2, 4 and 6 respectively, in druta units. Thus the tala-prastara considered in 
Sangitaratnakara is sometimes referred to as caturariga-tála-prastára.é 

This process of tala-prastara is a generalization of the process of matra-vrtta- 
prastara discussed in the text of Chandas-sastra. There, only laghu and guru are 
considered and they are of one and two time units respectively. Ganita-kaumudt 
of Nàrayana Pandita (c.1356) discusses a different kind of generalization of the 
matra-vrtta-prastara of Chandas-sdastra, by including units which are also of 
3,4,..., g-matras in the prastara. The tala-prastara of Sarigitaratnakara cannot 
however be subsumed under Narayana’s generalization. 


3.2. Pratyayas 


Pratyayas are simple techniques to comprehend the diverse features of the large 
number of talas which are noticed in practice, without having to enumerate them 


6In a series of books, the well known musician and musicologist Vidwan Akella Mallikarjuna 
Sarma, formerly Principal, Government College of Music, Hyderabad, has given a detailed exposition 
of the theory of tàla-prastára as expounded in Sarigitaratnakara. According to Vidwan Sarma, 
there are some later texts of Sarigita-sastra (as yet unpublished), which include some more talarigas, 
such as anu-druta etc., and different kinds of laghus (khanda-jati, etc.) in their discussion of tala- 
prastaras. It is important to bring these texts to light in order to understand further developments of 
the theory of tala-prastdra due to the successors of Sarngadeva. 
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explicitly each time. Sarngadeva first lists the nineteen pratyayas considered in 
this Sdstra, 


SAS ef YATRA ETAT i 


AANA: ¥.3890-396) 
The pratyayas which are being discussed by Sarngadeva are the following: 
e Prastara: Systematic generation and enumeration of all possible ta/a-forms 
with a given total number of time-units. 
e Samkhya: Determination of the number of tala-forms in a prastara. 
e Nasta: Ranking of a given tala-form in a prastara. 
e Uddista: Finding the tala-form corresponding to a given rank. 
e Pütàla: Total number of drutas, laghus, gurus and plutas in a prastara. 


e Druta-meru, Laghu-meru, Guru-meru and Pluta-meru: Finding the num- 
ber of tala-forms with a given number of drutas, laghus, gurus and plutas, 
in a prastara. 


e Samyoga-meru. 
e Khanda-prastara. 


e Uddista (and nasta) ranking (and un-ranking) a particular £ala-form among 
those with a given number of druta, laghus, gurus or plutas. 


3.3. Prastara-vidhi (Rules for the formation of the Prastara) 
TATA AeA SoA zn | 
We AGI Td Het Pau. 
yeaa aedi xarsurd fifa: 


aiga: ard: weis oat À oa 
gd one WARS F F q STRE Ser T: 


AANA u.a?&—-a96) 
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Sarngadeva’s procedure for the construction of prastara is as follows: 


e The last row of the prastara has all drutas only. 


e In the first row, place as many plutas as possible to the right, followed (from 
right to left), if possible, by a guru and a druta or a guru alone; or by a 
laghu and a druta or a laghu alone; or by a druta alone to the left. 

e To go from any row of the prastara to the next, identify the first non-druta 
element from the left. Place below that the element next to it with lower 
duration: Place druta below a laghu, laghu below a guru and guru below a 
pluta. 


e Bring down the elements to the right as they are. 

e Make up for the deficient units (if any) by adding to the left as many plutas 
as possible, followed similarly by gurus, laghus and drutas in that order 
from right to left. 

Example: Sapta-druta-prastara 

We shall use the symbols D, L, G, P to stand for druta, laghu, guru and pluta 
respectively. The traditional symbols for these are somewhat like o, |, S, S. In 
Table 6, we display the sapta-druta-prastara or the enumeration of all rhythmic 
patterns of seven time units following the rule of enumeration of Sarfigadeva. We 
also show the time duration of each talariga on the right. 


3.4. Samkhya 


Sarngadeva next discusses the pratyaya of samkhyd, which gives the number of 
rows in the prastara associated with rhythms of a given time measure. 


GTSk Im u.3993-322) 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


74 Raja Sridharan, R. Sridharan, M. D. Srinivas 


Tala forms 


1. 


NPN NEM) NM) NM) pl N| 4| 


| 19 | 2 
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| 21 | | | 
| 22 | 
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= 52 EEEEEEESSREEE 
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|D|D|D|D] |2[1]I |I 
|ipjp[b|bp|b|D][1]T1]|1]1]1) 


TABLE 6. Seven-druta-prastara. 


The sequence of samkhyarikas Sn, where S, is the number of tala forms in 
the n-druta-prastara, is obtained as follows: The first two elements of the se- 
quence are $; = 1, S2 = 2. The n-th element of the sequence (S) is obtained by 
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adding the previous (S,,—;), second-previous (5,2), fourth-previous (S,—4) and 
the sixth-previous (5,5) elements. That is, the sarnkhydrikas satisfy the recur- 
rence relation: 


Sn = Sn-1 + Sn-2 3r Sn-4 + Sn—6 


In case the fourth or sixth-previous elements do not exist, the third and the fifth- 
previous (if they exist) are to be taken as their representatives (pratinidhi). As we 
shall see, this happens only in two cases, namely $4, S6. 

Sarngadeva also makes the very interesting observation that among all the 
tala-forms which appear in the n-druta-prastara, $,—, end in a druta, S,—2 in 
a laghu, S,—4 in a guru and S,—6 end in a pluta, and the total number of tāla- 
forms S,, in the n-druta-prastara is just the sum of these four numbers. We shall 
subsequently discuss the significance of this observation. 


I112[3[4]-5-|6n p m|BsuP INT 
$,|1|2|3]|6]|10]19 [33] 960 | 106 | 191 | 


TABLE 7. Sequence of samnkhyankas. 


In Table 7, we give the sequence of samkhyarikas $,. Here, Sı = 1.52 
2, $4 = 2+1 2 3, Sy = 34241 = 6, $5 = 6434-1 = 10, So = 104+6+2+1 
19, S7 = 19--104-34-1 = 33, Sg = 33+19+6+2 = 60, and so on. The number 
1 which appears in the sum for obtaining $4 and Se, is the third-previous and 
the fifth-previous samkhyarika respectively, and is used here as the representative 
(pratinidhi) of the fourth-previous and the sixth-previous sarnkhyarikas, as they 
are absent. It is sometimes suggested that we can get rid of this prescription to 
use the representatives, by introducing an entry Sọ = 1, before the sequence of 
samkhyankas.’ The recursion relation Sn = Sn-1 + Sn-2 + Sn—4 + 51-6 for the 
samkhyankas is a generalization of the recursion relation 


ll 


Sn = Sn—1 + Sn—2 


ra. The latter relation 
]led Fibonacci se- 
ce of numbers. 
ry interesting 


that defines the samkhyarikas s, in a matrà-vrtta-prasta 
was noted by Virahanka (c. 650) and gives rise to the so C4 
quence or, more appropriately, the Virahanka-Fibonacci seque? 
Indeed the mathematical theory of tala-prastàra turns out to be ê V* 


= ;pliography. 
s for instance, the works of Akella Mallikarjuna Sarma cited in the bibliog"? of generalization 
In Ganitakaumudi (c.1356), Narayana Pandita discusses a different Sn = Sa-i Sa—2 + 
of the Virahanka-Fibonacci sequence by considering the recurrence relatio” : 
- + Sn—q to generate what he calls as the sámasiki pankti of order q. 


CC-0. Gurukul Kangri Collection, Haridwar ies 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


76 Raja Sridharan, R. Sridharan, M. D. Srinivas 


TABLE 8. Four-druta-prastcra. 


generalization of the mathematical theory of the matra-urtta-prastara as we shall 
see below. 


3.5. An inductive construction of the T'ala-prastára and the Samkhya 
rule 


The construction process of tala-prastara is indeed a generalization of the con- 
struction of matra-vrtta-prastara as described in texts of Sanskrit prosody. Whilst 
laghu (l) and guru (g) are units of value 1 and 2 respectively in prosody, in the 
theory of musical rhythms there are four units druta (D), laghu (L), guru (G) 
and pluta (P) which have values 1, 2, 4 and 6 respectively. 

In prosody, a row of a prastára, of a given value n, consists of a sequence 
of laghus (I) and gurus (g) such that the total value of the row is n, where the 
total value is by definition the sum of all the values of the symbols in the row. 
The prastara is actually an enumeration of all sequences of / and g by a certain 
rule, such that each row has value n, and all possible rows of value n are included 
under the rule of enumeration. The construction of the prastara is achieved by an 
inductive process. 

A similar process is at work in the case of tala-prastara. The prastara of 
value 1 is just D. So the corresponding samkhya (the number of tala-patterns 
corresponding to a given value) is $; — 1. The prastara of value 2, consists of two 
rows, L and DD and correspondingly S2 = 2. The prastara of value 3, consists 
of three rows, DL, LD and DDD and correspondingly $3 = 3. The prastara of 
value four has six rows as shown in Table 8, aud correspondingly $4 = 6. 

The prastara of value five has ten rows as shown in Table 9, and correspond- 
ingly Ss = 10. The prastara of value six has nineteen rows as shown in Table 10. 
Thus Sg = 19. In each of the above cases, we have an enumeration of all rows of 
value 1, 2, 3, 4, 5 and 6. 

It is now clear that the tala-prastara of value n can be constructed in the fol- 
lowing manner. We first take the prastara consisting of rows of value n — 6 and 
augment it by adding a pluta (P) at the end. Below that, place the prastara of 
value n — 4 and augment it by placing a guru (G) at the end. Below that, place the 
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TABLE 10. Six-druta-prastara. 


prastara of value n — 2 and augment it by placing a laghu (L) at the end. Below 
that, finally, place the prastara of value n — 1 and augment it by placing a druta 
(D) at the end. We can see the above process clearly at work by noting that the 
prastara of value 7, given in Table 6, is constructed in the manner described above 
from prastaras of values 1, 3, 5 and 6. 
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3.6. The generating function for Samkhyankas S, 


We first explain how, in the case of matra-vrtta-prastara in prosody, the sequence 
of samkhyas (the number of moric metres of a given value) may be obtained. For 
example, let us consider the row /gg which occurs in the prastara of rows of 
value 5. We associate a term in the expansion (x -+ x2)? namely the term x? which 
is obtained by taking x from the first factor and x? from the second and third 
factors and multiplying the monomials so chosen. To consider another example, 
to the row glll, we associate a term in the expansion of (x + x?) namely the 
term x? which is obtained similarly by taking x? from the first factor and x from 
the second, third and fourth factors. From such examples it is clear that if s; is 
the number of rows in the matra-vrtta-prastara of value n, then the generating 
function of s, is given by 


ood = jld (abe e Tax... 
[jt — esse E 


Il 


It turns out that the number s, is the so called n-th Fibonacci number, or what may 
more appropriately be called the n-th Virahanka-Fibonacci number. 

It follows in a similar manner that the generating function for the samkhyankas 
Sn (the number of rows in the tala-prastara of value n) is given by 


uS = ibana d toc) Go-tx? fort 4- x9)? 4... 
ETE Be E 
pb (xx? + x4 4 x7! (5) 


Therefore, we have 

U= @ ex? txt 4 x$)] Y Sax” = 1. 
Equating coefficients of the low powers of x, we get S} = So = 1, S2 = S1 +50 = 
2, S3 = S2 + Sı — 3, S4 = S3 + S2 + So = 6, etc. Further, equating coefficients 
of x” forn > 6, we get 


Sn = Sn-1 + Sn—2 + Sn—4 + Sn—6 for n > 6. (6) 


We may refer to the samkhyarikas S, as the Sarngadeva numbers. These num- 
bers satisfy a more complex recurrence relation (6) as compared with the simple 
recurrence relation 


Sn = Sn—1 + Sn-2 (7) 
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that characterizes the Virahanka-Fibonacci numbers. 


3.7. The decomposition of an integer in terms of Sarhgadeva numbers 5, 


In this section we shall exhibit a procedure to decompose an integer m > 1, in 
terms of the Sarngadeva numbers S, using induction. The motivation for this 
comes from the following result for the Fibonacci numbers which was demon- 
strated in the mathematical theory of matra-vrtta-prastara.” 

Proposition: Any positive integer is either a Fibonacci number or can be ex- 
pressed uniquely as a sum of non-consecutive Fibonacci numbers. 

In what follows, we shall prove an analogue of this result for Sarhgadeva num- 
bers. If m = 1, we write 1 = Sj. Then we write 2 = $5, 3 = $3, 4 = Sı + 53. 
Now, for any general m, choose the largest integer q such that $; < m. By induc- 
tion, following the same procedure, we can decompose m — Sq, that is write it in 
the form 


m — Sg = Si, + Sip +--- + Sj with ij S<i2 <...<ip<q 
Then, 
m = Si + Sip +... + Si, S, with ijj <i2 <...<ipsq (8) 


is a decomposition of m which we shall refer to as the canonical decomposition. 
Here, for any m, with 1< k < p, ix is the largest integer such that S; < (m — 
(Sigg) + Sii.) +- - -+ Sip +S4)). We may also note that if m is itself a Sarngadeva 
number S,, then m = Sg is the canonical decomposition of m. We shall now 
show that the canonical decomposition has certain properties. For this we need 
the following lemma. 

Lemma 1: The Sarngadeva numbers S, satisfy the following relations: 


(i) 2$, > Sn+1, and equality holds only if n = 1,3 (9) 
(ii) Sn + Sn+1 + $542 > $443, and equality holds only if  — 1 (10) 
(ii) Sn + Sn+1 + Sn43 + Sn+4 > $545, and equality holds only if n = 1(11) 


Proof: These relations follow directly from the recurrence relation (6). We 
have 


Sn = Sn—1 + Sn-2 + Sn—4 + Sn-6 


9 The mathematical theory of matrà-vrtta-prastàra is outlined in R. Sridharan, Pratyayas for 
Matravrttas and Fibonacci Numbers, Mathematics Teacher, 42, 2006, 120-137. 
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We also note that form > 1, Sn > $,-,. Hence 


2Sn = 285-1 3t 2Sn—2 JF 2Sn—4 + 2Sn—6 
> Sn + Sn—1 + Sn—3 + Sn—s (by induction) 
= Su. 


This proves (i). Now we prove (ii) by induction. We note that property (ii) holds 
if n = 2. Now, we use the recurrence relation (6) to write 


Sn Suid Sny? = (Sui + Sn + Sn+1) + (Sn—2 + Sn-1 + Sn) 
+(Sn—4 + Sn—3 + Sn—2) + (Sn—-6 + Sn-5 + Sn—a) 
Sn42 + Sni + Sn-1 + Sn-3 (by induction) 
= Sn+3 


IV 


This proves (ii). Property (iii) follows directly from the recurrence relation (6), 
as we have 


Sn T Sn+1 + Sn+3 EL: Sn+4 2 Sn-1 + Sn+1 3r Sn43 T Sn+4 = Sn+5 


This completes the proof of the lemma. 
Now consider some number m < S, and let 


m — Si, + Si, +...+ Si, with ij <i2 <... <i; (12) 


be the canonical decomposition of the number, where we have taken note of the 
fact (which follows from (9) and the construction of the canonical decomposition) 
that each Sarngadeva number S; appears at the most once in the canonical de- 


composition of a number. We can therefore write the above decomposition in the 
form 


m= J aiSi, with a; = O or 1, for] <i <n, anda, =0 (13) 


Hence, to each such decomposition (12), we assign a binary sequence 
(a1, a2, ..., &n—1, 0) of zeroes and ones of length n in the following manner. 
With the number m, which has the canonical decomposition (12), we asso- 
ciate the sequence of length n with zeroes at all places except at the positions 
ij < i? < ... < i; where it has ones. From the above lemma, we can easily 
see that such a binary sequence of length n (ending with 0), associated with any 
number zi < Sn, has the following properties: 


(a) A combination of the type (1, 1, 1), that is three 1s in a succession, cannot 
occur as a part of the above sequence. 
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(b) A combination of the type (1, 1, 0, 1, 1) cannot occur as a part of the above 
sequence. 

(c) A combination of the type (1, 0, 1, 0, 1, 1) cannot occur as a part of the above 
sequence [This follows from the recurrence relation (6)]. 


(d) The first two entries cannot both be 1s in the above sequence [This follows 
from the fact that the first three Sarngadeva numbers are related by $4 = 
Si + S2]. 


(e) The sequence cannot begin with the combination (1,0,1,1,0) [This follows 
from the fact that Ss = S, + S3 + $4]. 


We now proceed to show in Proposition 3 that the above conditions are sufficient 
to characterise the binary sequences that are associated with the canonical decom- 
position of a number. We first prove a lemma that is needed for the proof of this 
proposition. 

Lemma 2: Let 


m = Si + Si, +.--+ Si, with i; «i2 <... « i, 


be any decomposition of a number into Sarngadeva numbers and assume that the 
corresponding binary sequence of 0’s and 1’s satisfies the conditions (a)-(e). Then 


Sig tSn hst ES Sil (14) 
Proof: We shall prove (14) by induction. We note that (14) is equivalent to 


Si + Si; +... + Sig) < Ay (15) 
(which we know is S;,-; + Si -3 + Si, -5)- 


If ij—| < i; — 1, then by induction, 
Si 4S5... Sii) € Sic +l < Si,-1 


and we are through with (15) and hence with (14). Suppose, ij-1 = ir p. "m 
we can cancel both S;, ,, = Sj, from both sides of (15) and we are oum 
that 


(16) 
Si + Sig +... + Signy < Sii -3 + Si,=5- 

: , , i i-2 < 
If i2 < i, — 3, then we are through by induction. Otherwise Be 2, then 
doy < i and i1 = i; — 1, i5 is either i, — 2 or i, — 3. If it-2 T ‘which isa 
since i = ir — 1, the combination (1, 1, 1) appears in the seque we can 
contradiction. Thus, the only possibility is i,_ in which 


5=i— E 
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cancel Sj,,_,, from both sides of (16) and are left to prove 
Si, AF S AP oco PIER, < Sj,-5 (17) 


Now, i;-3 < ij-2 = i; — 3. Suppose, i;-3 < i; — 5, then we are through by 
induction as before. Otherwise, we have i;-3 > i; — 5; then since i;-3 < i;-2 = 
i, — 3, we have that i; —5 is either i; — 4 or i; — 5. If, i;-3 = i; — 4, then the binary 
sequence will have the combination (1, 1,0, 1, 1) which has been excluded by 
assumption. On the other hand, if i;_3 = i, — 5, then the binary sequence will have 
the combination (1, 0, 1, 0, 1, 1) which has also been excluded by assumption. 
Hence the lemma is proved. 
Proposition 3: Let 


m= Si cS... Si, with ij «i2 «€... «i; (18) 


be any decomposition of a number and assume that the corresponding binary se- 
quence of 0’s and 1’s satisfies the conditions (a)-(e). Then the above decomposi- 
tion is the canonical decomposition of m. Equivalently, let 


m = Sj, +Si,+...4+ S WINE io c «1 
= Sj, + Sj, +... + Sj, WINE j2< «gr (19) 


be any two decompositions of the number m, such that the corresponding binary 
sequences satisfy the conditions (a)-(e). Then the two decompositions in (19) are 
the same. 

Proof: If i; = j, , then we can cancel S; = Sj, from both the decompositions 
and use induction to prove the proposition. Hence we can assume without loss of 
generality that i; < j,. Now, from the above lemma it follows that 

Si F Si, Fe. S5 < Sia 

But, since i, < j, it follows that S;,41; < 5; . Hence, we get 

Si, + Si; +... + Si, < Sj, which contradicts the assumption (19). Hence the 
proposition is proved. 

Corollary: The above propositions establish a bijection between the set of non- 
negative integers 0, 1, 2, ..., Sn — 1, which label the rows in a n-druta-prastara, 
and binary sequences of length n (ending with 0), which satisfy the conditions 
(a)-(e) stated above. 


3.8. Tbe mathematical basis of the nasta and uddista process 


We enumerate the rows of a prastara of a total value n starting from the last row, 
numbering the last row as 0, the penultimate row as 1, and so on. If the rank of a 
row in this numbering is t, then the number of the row in the usual enumeration 
(starting from the top and calling the top row as row number 1) is Sn — t. The 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Combinatorial Methods in Indian Music 83 


process of nasta consists in writing down explicitly the row (without the aid of 
the prastdara), if one is given the row number m in the enumeration we have just 
described. If mm = 0, then the row consists of n Ds. If m < $,-, then the last 
symbol of the row is D because the last S,— rows of the n-druta prastára end 
with a D, by very definition. If $,; < m < Sn-1 + $4—2. then the m-th row ends 
with an L. If S,~; + S,-2 < m < Sn-1 + $42 + Sn-4, then the row ends with a 
G. If $41 + Sn—2 + Sn—g < m < Sn-1 + Sn—2 + Sn—4 + Sn—6, then the row ends 
with a P. 

This leads one to the expectation that the canonical decomposition of m(1 < 
m < Sn), determines explicitly the sequence of symbols that occur in the m-th 
row. To show this, we proceed inductively as follows. 


(i) If the last symbol is D, that is m < S,—1, then the penultimate symbol of the 
m-th row is the last symbol of the m-th row in the prastara of value n — 1, 
where the last row has n — 1 drutas. 


(ii) If the m-th row ends with an L, that is S,-; € m < $451 + $4—5. then the 
penultimate symbol of the m-th row is the last symbol of the (m — S,—;)-th 
row of the prastàára of value n — 2. 


(iii) If the m-th row ends with a G, that is Sn-1 + $422 € m < Sn-1 + Sp-2 + 
Sn—4, then the penultimate symbol of the m-th row is the last symbol of the 
(m — (S,—| + $,—2))-th row of the prastara of value n — 4. 


(iv) If the m-th row ends with a P, that is $4, + Sn-2 + Sn-4 € m < S,-1 + 
Sn—2 + Sn—4 + $,—6. then the penultimate symbol of the m-th row is the last 
symbol of the (m — (S,—1 + Sn—2 + Sn—4))-th row of the prastara of value 
n — 6. 


The next or the second penultimate symbol is then determined in the same way. 
And so on. This essentially establishes the following relation between the canon- 
ical decomposition of the row number m and the sequence of symbols that occur 
in the row. 

Proposition 4: Given any number 0 < m < Sn let the corresponding canonical 
decomposition be obtained and re-expressed as a binary sequence of 0’s and l's 
in the manner described earlier. Then, the symbols in the m-th row (enumerated 
from below starting from the last row numbered as 0) in the prastara of value n, 
are as follows: 


(i) If the sequence ends with (0, 0), then the last symbol is a D and the next 
symbol is the last symbol of the row whose number is given by the truncated 
binary sequence with the last 0 removed, in the prastara of value n — 1. 


(ii) If the sequence ends with (0, 1, 0), then the last symbol is an L and the next 
symbol is the last symbol of the row whose number is given by the truncated 
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binary sequence with the end portion (1, 0) removed, in the prastara of value 
n—2 


e. 


(iii) If the sequence ends with (0, 0, 1, 1, 0), then the last symbol is a G and the 
next symbol is the last symbol of the row whose number is given by the 
truncated binary sequence with the end portion (0, 1, 1, 0) removed, in the 
prastara of value n — 4. 


(iv) If the sequence ends with (0, 0, 1,0, 1, 1, 0), then the last symbol is a P and 
the next symbol is the last symbol of the row whose number is given by the 
truncated binary sequence with the end portion (0, 1, 0, 1, 1, 0) removed, in 
the prastara of value n — 6. 


We now look at the converse process of uddista, where the objective is to de- 
termine the row-number (in the enumeration given above) given the sequence of 
symbols that occur in some row of the prastara of value n. Suppose the row- 
number is m and we want to determine m. Suppose, the last symbol is D, then 
m < S,—, and therefore $,..; does not occur in the canonical decomposition of 
m. If we delete the last D, we obtain a row of value n — 1, whose row-number t in 
the prastara (of value n — 1), can be determined by induction and we have m — t. 
Suppose the last symbol is L, then S,,—; occurs in the decomposition for m, and 
after deleting L, we get row of value n — 2, whose row-number t in the prastara 
(of value n — 2) can be determined by induction on n. We then set m = $4, +t. 
If the last symbol of the m-th row is G, then S,— and S,—2 occur in the decom- 
position of m, and if we delete this G, we obtain a row of value n — 4, whose 
row-number t in the prastdra (of value n — 4) can be determined by induction 
and we set m = Sn—1 + S,-2 + t. If the last symbol of the m-th row is P, then 
$5—1, Sn—2 and $,—4 occur in the decomposition of m, and if we delete this P, we 
obtain a row of value n — 6, whose row-number t in the prastara (of value n — 6) 
can be determined by induction and then we set m = Sn-1 + Sp—2 + S44 +1. 
This is the process of uddista. 

We have outlined the theory of the canonical representation of a number as a 
sum of Sarngadeva numbers (samkhyankas Sn), which constitutes the mathemat- 
ical basis for the nasta and uddista processes discussed by Sarngadeva. We shall 
now outline the processes of nasta and uddista as described by Sarngadeva him- 
self. His rule of enumeration of tala-prastdra also has an elegant mathematical 
description if viewed in terms of the canonical decompositions associated with 
two successive row-numbers. We shall discuss this in Appendix A. 


3.9. Nasta process as discussed by Sarngadeva 


aAa Ae: fees gf yeah od 
qa qaen qure | 
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Para reet mA TEAR paraa 


EEA A E gd aac 
AZRIA FA TAA OA: [ 
aft agen faepe Sofia 


IMAL: J.392—-220) 


The nasta process is outlined by Sarngadeva in the above verses, which may 
be paraphrased as follows: In order to find the tala-form which is associated 
with the row-number r (nastarika) of a n-druta-prastara, write the sequence of 
samkhyankas S1, S2, ... Sn. Subtract the rank-number from $,. From the result 
(S, — r), subtract 5,..; if possible. If S,—; can be subtracted, it is called reducible 
(patita), and then subtract S,—2 (if possible) from the result (S; — r — $,—1) of 
this subtraction. If S,—) cannot be subtracted from S, — r, it is called irreducible 
(apatita) and then subtract S,,—2 (if possible) from the number S, — r. And so on. 

Mark all the sarnkhyarnikas 1,2... $,—, as either patita or apatita. Sn, being 
larger than r, is always apatita. Scanning the samkhyarika sequence right to 
left from S,,, obtain the successive talangas of the tala-form from right to left, 
according to the following procedure. 


e We start with S, which is apatita. Then there are the following possibilities: 


e If the following samkhyarika S,—; is also apatita, then write a D corre- 
sponding to S, and proceed with scanning from Sn—1- 


e If S„—ı is patita and $,—2 is apatita, then write an L corresponding to the 
two samkhyarikas Sn and S,—; taken together, and proceed to scan from 
$,—-2. 


e If the samkhyarikas S,—1, Sn—2 are patita and S,—3 and S,—4 are apatita, 
then write a G corresponding to the four samkhyankas Sn, Sn—1, Sn—2 and 
Sn—3 taken together, and start scanning from S;,—4. 


e If the samkhyankas $,—1, Sn—2, Sn—4 are all patita and S,—3. S,—s and 
Sn—6 are apatita, then mark a P corresponding to the six samkhyarikas 
Sn, Sn—1, Sn—2, Sn—3, Sn—4 and S,—s5 taken together, and start scanning from 
Sa 6s 
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e Given the way the samkhyanikas are generated, the above four cases exhaust 
all the possibilities. 


e Proceeding thus, mark all the talarigas from right to left. 


In the above process of nasta for finding the tala-form associated with row r in 
a n-druta-prastara, Sarngadeva essentially gives the procedure for constructing 
what we have described as the canonical decomposition of the number Sa — r. 
If we mark the patita samkhyarikas by | and the apatita by 0, then following 
the procedure of Sarhgadeva, we obtain the binary sequence that is associated 
with the number S, — r via its canonical decomposition. Then Sarngadeva’s 
prescription for identifying the talarigas that occur in the tala-form appearing 
in the r-th row is essentially the same as what we have stated as Proposition 4 
above (except for the fact that we have used an enumeration of the rows where 
the rows are numbered from 0 from the below, while Sarügadeva uses the usual 
enumeration of rows starting with row number 1 at the top of the prastara). 

The signatures of various talarigas as given by Sarngadeva are given in the 
Table 11. Here, “0” stands for apatita and “1” for patita. Further, “(0)” (zero 
enclosed by brackets) indicates that the previous samkhyarka (if any) has to be 
an apatita for the identification to take effect. 


Examples 


The 8'^ tala-form in the 7-druta-prastara 
33 —8 = 25, 25-19 =6, 6—-6=0 
Thus the patita and apatita samkhyankas are as follows 


[orjojo[|o[rjo[1[0] 
[Sa [1/2/3[6]10][19 | 33 | 


e Starting from 33, since 19 is patita and 10 is apatita, we get a laghu at the 
right extreme. 


e Starting from 10, since 6 is patita and 3 is apatita, we get another laghu to 
the left of the first. 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Combinatorial Methods in Indian Music 87 


e Starting from 3, since 2 is apatita, we get a druta to the left. 
e Starting from 2, since | is apatita we get one more druta to the left. 
e Since | is apatita, we get one more druta, 
Thus the tala-form is DDDLL. 
The 28'^ tala-form in the 7-druta-prastara 
33, 19, IO and 6 are apatita and thus we get DDD from the right. Starting from 


6,3 and 2 are patita and | is apatita. They give a G, thus giving the tala-form 
GDDD. 


[orpfo[1[1]o[ o 
[5 It [2 13 16130 [19 [33] 


3.10. Uddista process as discussed by Sarngadeva 


gasa PiN He: wa gT: | 
SRE aa rengia TEASE [ 


AANA: ¥.339-332) 


Given a tala-form associated with an n-druta-prastara, to find its rank-number 
the uddista process is employed. In the above verses, Sarngadeva essentially 
states that since uddista is the reverse of the nasta process, we write the sequence 
of samkhyankas and identify the patitas among them which give rise to the given 
sequence of talarigas. This effectively leads to the following process, which has 
also been discussed earlier in the general mathematical context. 

We write the samkhyankas 1,2,..., Sn above the talarigas from the left. We 
write one samkhyanka above a D, two above an L, four above a G and six above 
a P. Since the samkhyanka associated with a D is apatita, it is not marked. Since 
the first sarnkhyarika associated with an L is patita and the second is apatita, 
only the first is marked. Since only the second and third of the samkhydankas as- 
sociated with a G are patita, they are marked. Since, only the second, fourth and 
fifth samkhyankas associated with a P are patita, they are marked. The sum of 
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the patita or marked samkhyarikas is subtracted from S, to get the rank-number 
of the tala-form. 


Examples 


To find the rank of LDLL in 7-druta-prastara 


[1] 2] 3 | 6] 10] 19 | 33 | 
2a TER SEN a 


Total of the patita samkhyankas: 19 + 6 + 1 = 26. Rank: 33 — 26 = 7. 


To find the rank of GDL in 7-druta-prastara 


[1] 2[3[ 6] 10] 19 [33 
ESTEE DY L. | 


Total of the patita samkhyankas: 19 + 3 + 2 = 24. Rank: 33 — 24 = 9. 


To find the rank of P D in 7-druta-prastara 


[1|2]3]6[10]19 | 33 | 
ERN ew D" 


Total of the patita samkhyarikas: 10 + 6 + 2 = 18. Rank: 33 — 18 = 15 


To find the rank of DDLLD in 7-druta-prastara 


| 1 | 2 | 3] 6 | 10] 19 | 33 | 
[p[|D| L | EL IDI 


Total of the patita samkhyankas: 10 + 3 = 13. Rank: 33 — 13 = 20. 


3.11. Patala 


Weal SHAS HAZTSATH AAA: | 

PHISH AAA INET I 
SEIF FERCC greg aqna t 
fb q waa faq gie: d 
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sedingdtndasy fafafa | 
FHA FATA: I 
Aad ase: ware: asa 


GANAF: u4.2233-2224) 


The patala sequence T], 75, ..., Ta, ...is defined inductively as follows: First 
write 1 (So = 1) followed by the sequence of samkhyarnkas S1. $2...., : EC. 
Below these the Patala sequence is to be set down. The n-th patala number (Tn) 
is obtained by adding the previous samkhyanka (S51) to the previous, second- 
previous, fourth-previous and the sixth-previous patala numbers whenever they 


exist. That is, 


T, — Sn=1 elle Tn—2 + Tn-4d- T,—6 (20) 


In the case of patala sequence, there is no consideration of any representative in 
case the fourth or sixth previous elements do not exist (which was done in the case 
of samkhyarika sequence).!? 

The patala numbers Ta, T,—1, 7,3 and 7,—5 give the total number of D, L, G 
and P which occur in the (various tala-forms which are included in the) n-druta 
prastara. We can easily deduce this fact from the inductive construction of 
prastara outlined earlier. For instance, let D,, be the total number of drutas in the 
n-druta prastara. The last S„—ı rows which end in a D have $,—, + D4—, drutas. 
The number of drutas in the rows which end in L, G and P are D,—5, Dn—4 and 
Dn—6 respectively. Hence, D,, satisfy the recurrence relation 


D, — Sn=1 + Dn-\ + Dy-2 + Dn—4 + Dn—6 


which is the same as the recurrence relation (20) satisfied by the patala numbers 
T,. The equality of D, and T, follows from this and the fact that D; = T; for 
small values of i. 

Further 


Mn = Tn + Ta-1 + Tn-3 + Tr-5 = Tn+1 = Sn, (21) 


gives what is sometimes called the maha-patala, which is the total number of all 
talangas (D, L, G and P) which appear in the n-druta prastara. 

In Table 12, we give the patala sequence. Here, To = 0, Tj = 1, 73 = 1+1 = 
2, Tz =2+24+1=5, T4 =34+54+2=10, Ts =64+10+5+1=22, Tg — 
104- 22-4- 10 +2 = 44, T; = 19 -- 44 4-22 4- 5 - 1 = 91, and so on. 


lOr other words, we set T; = 0, fori < 0. 
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106 


180 | 358 
298 | 592 


TABLE 12. Patàla sequence 


WRITS up ueunuuadi 
ae eren fader att rears wy d 
we Seen gaisa d 
aed Wide gaa TEs: N 
aq mad ag Pera PAGA: | 
TAA q GAT HB SARI STRUT: d 
ag: mAsa nyera | 
Aonad GENUS FAI 


ay diet: 4.336-38) 


After considering the pratyaya patàla, Sarngadeva begins the discussion of the 
pratyayas druta-meru, laghu-meru, guru-meru and pluta-meru, which are 
analogous to the pratyaya eka-dvayadi-laga-kriya for matra-vrttas. There, the 
druta-meruis to be constructed to compute the number of tala-forms in a prastara 
which have a given number of drutas. 

While considering an n-druta prastara, start with a first row which has n en- 
tries. The row above will not have any entry in the first column, the successive 
rows after that will have two columns less at each stage. In each row, the first two 
entries from the left will be 1, 1. In the first row, the odd places are filled with the 
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8 
ENEJKHE JE 


[14 [25 | 44 


HAHHA 


TABLE 13. Druta-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 2, 4, etc., drutas respectively when n is even, and the number of 
tala-forms which have 1, 3, 5, etc., drutas respectively when n is odd. 


sum of the previous, second- previous, fourth-previous and the sixth-previous en- 
tries as in the case of samkhyàrikas. The even places are to be filled with the same 
sum except for the previous entry. Where the fourth or sixth-previous entry is not 
there, the third or the fifth-previous entry is to be used as the representative. In the 
other rows, the odd places are filled with the sum of the previous, second-previous, 
fourth-previous and the sixth-previous entries as in the case of sarmkhydrikas. The 
even places are to be filled with the same sum except that the previous entry is to 
be taken from the row below. Where the fourth or sixth-previous entry is not there, 
no representative is to be used in their place. 

Then, the entries read from below in the n-th column give the number of tala- 
forms in the n-druta prastara with 1,3, 5, etc., drutas respectively, in the case 
when n is odd; they give the number of tala-forms in the n-druta prastara with 
0, 2, 4, etc., drutas respectively, in the case when n is even.!! Notice that the sum 
of all the entries in any column is always equal to the corresponding samkhyanka. 

In Table 13, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows. 


e First row odd entries: 1 = 1, 2= 1+1, 5=2+2+4+1, 12-44-5424 
1, 26=74+124+5+2 


e First row even entries: 1 = 1, 2=1+1, 4=2+1+1, 7=4+2+1 
(in second and third sums | is the pratinidhi or the representative) 


e Second row odd entries: 4 =3+1, 14=9+4+1, 44 — 25-14-44 1 


since L,G and P correspond to kala-pramanas 2.4 and 6 which are all even, a n-d: i 
prastāra will have rows which contain only odd number of Ds when n is odd, and only 
of Ds when n is even. 
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e Second row even entries: 3 224-1, 9=5+341, 25 — 124-9 4-341 


For example, in the 5-druta prastara, there are 5 tala-forms with 1 druta, 4 
with 3 drutas, and 1 with 5 drutas. In the 6-druta prastara, there are 4 tāla- 
forms with 0 drutas, 9 with 2 drutas, 5 with 4 drutas and | with 6 drutas. In the 
7-druta prastara, there are 12 tala-forms with 1 druta, 14 with 3 drutas, 6 with 
5 drutas and 1 with 7 drutas. 

The above construction of the druta-meru is in fact a generalisation of the 
construction of the matra-meru used in the eka-dvayadi-lagakriyà for the matra- 
vrttasin the Prakrta Pairigala (in Prakrta, c. 1300) or the commentary Narayani 
of Narayanabhatta (c.1555) on Vrttaratnakara of Kedarabhatta. There, the 
meru is constructed essentially by a recurrence relation of the type g} = gp + 
qum where gj is the number of metrical forms in the prastaára of n-matras with k 
gurus. These can be shown to be related to the binomial coefficients and their sum 
for any given n will be related to the samkhyankas or the Virahanka-Fibonacci 
numbers. !2 

Let Dj denote the number of tala-forms with k drutas in the n-druta prastara. 
If we refer back to the inductive construction of the prastara explained earlier, we 
can easily see that of these tala-forms, Dish will be the number of £ala-forms 
which end in a druta, while Du Dus and DE will be the number of tala- 


forms which end in a laghu, guru and pluta, respectively. Thus we have the recur- 
rence relation: !° 


Dg = Dii + De? + De 4+ Dió (for n>6 andk>0) Q2 


The above recursion relation satisfied by D7’, is in fact the basis for the above 
construction of the druta-meru. 

We shall now outline an alternate derivation of the above relation (22) using the 
generating function approach. For this purpose, we first note that the coefficient 
of tx” in (tx + x? + x^ + x9)? gives the number of possible tala-forms of value 
(kala-pramana) n with p talarigas (i.e., D, L, G or P) of which k are drutas (D). 
‘Summing over all the possibilities, we get 


[1 — (tx +x? - x* +x)! = 25 È oe | wee (23) 


12For details, we refer the reader to the mathematical theory of matrà-vrtta-prastara outlined in 
the paper of R. Sridharan, cited earlier. 

13For | € n € 6, or k — 0, we only need to take those terms which are meaningful in the 
recurrence relation (22). 
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We therefore have 


[1 — (rx x? + x* x5) >| 07 pe =] (24) 
n 


cn 


Comparing the coefficients of (^ x" on both sides of the above equation, we obtain 
the recurrence relation (22). Incidentally, by setting £ = 1 in (23), we obtain the 
obvious relation 


eS (25) 


k«n 


The recurrence relations (22) are not very transparent in the form in which druta- 
meru of Sarügadeva is laid out in Table 13, because the same row corresponds 
to k or (k + 1) drutas in alternate columns. In Appendix C, we shall display an 
alternative form of druta-meru proposed by Akella Mallikarjuna Sarma, which 
clearly reflects the above recurrence relation. 


3.13. Laghu-meru 


AANA: u.320-339) 


The laghu-meru is to be constructed to compute the number of tala-forms in a 
prastara which have a given number of laghus. 

Like in the druta-meru, while considering an n-druta prastara, start with a 
row which has n entries. The row above will not have any entry in the first column, 
the successive rows after that will have two columns less at each stage. In each 
row, the first entry from the left will be 1. In the first row, the places are filled 
with the sum of the previous, the fourth-previous and the sixth-previous entries 
in the same row. Where the fourth or sixth-previous entry is not there, the third 
or the fifth-previous entry is to be used as the representative. In the other rows, 
the places are filled with the sum of the previous, fourth-previous and the sixth- 
previous entries in the same row, and the second-previous entry in the row below. 
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EH. | 152 
L.11[2|3|4]|7|12]21 | 
[1]1/1[2[3|55 7 [10] 14 |21 | 
(i[2[3|]4[5|]6[7 [8 [9 [10] 


TABLE 14. Laghu-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 1, 2 etc., laghus respectively. 


Where the fourth or sixth-previous entry is not there, no representative is to be 
used in their place. 

Then, the entries read from the bottom in the n-th column give the number 
of tala-forms in the n-druta prastara with 0, 1, 2, 3, etc., laghus respectively. 
The sum of all the entries in any column is always equal to the corresponding 
samkhyankas. 

In Table 14, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows. 

Firstrow: 1 = 1, 1= 1, 1=1, 2=1+1, 3=24+1,5=3+1+4+1,7= 
5+1+1, 10=7+2+1, 14=10+3+1, 21 = 14+ 5+ 2 (in the fourth and 
sixth entry, 1 is the pratinidhi). 

Second row: 2 = 1+1, 3=2+1,4=3+1, 7=4+2+1, 12 = 
7+3+2, 21 =12+5+3+1, 34=21+7+4+2, 54 =34+10+7+3. 

For example, in the 6-druta prastāra, there are 5 tala-forms with O laghus, 7 
with 1 laghu 6 with 2 laghus and 1 with 3 laghus. In the 7-druta prastàra, there 
are 7 tala-forms with O laghus, 12 with 1 laghu, 10 with 2 laghus and 4 with 3 
laghus. 

If Li denotes the number of tala-forms with k laghus in the n-druta prastara, 
then we can easily see that we have the following relation analogous to (23) 


[L — (x + tx? + x* x57! = D 5 Lit a (26) 


n k<n 


From (26) we obtain the following recurrence relation 


Li = LE) + Lei + LII + LIS (forn > 6 and k > 0), (27) 


| CC-0. Gurukul Kangri Collection, Haridwar 


NEEDED EE UM pli f i-i — iii... iii iioii i i. i.t. nlii. 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Combinatorial Methods in Indian Music 95 


which is clearly the basis for the construction of the /aghu-meru as described 
above. 


3.14. Guru-meru 


ARNAF: J.24?-392) 


The guru-meru is to be constructed to compute the number of tala forms in a 
prastara which have a given number of gurus. 

While considering an n-druta prastara, start with a row which has n entries. 
The row above will have three columns less and the subsequent rows will have four 
columns less at each stage. In each row, the first entry from the left will be 1. In the 
first row, place 2 at the second place. The other places are filled with the sum of the 
previous, second-previous and the sixth-previous entries. When the sixth-previous 
entry is not there, the fifth-previous entry is to be used as the representative. In 
the other rows, the places are filled with the sum of the previous, second-previous 
and the sixth-previous entries in the same row, and the fourth-previous entry in the 
row below. Where the sixth-previous entry is not there, no representative is to be 
used in its place. 

Then, the entries read from the bottom in the n-th column give the number 
of tala-forms in the n-druta prastara with 0, 1, 2, 3, etc., gurus respectively. 
The sum of all the entries in any column is always equal to the corresponding 
samkhyanka. 

In Table 15, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows: 

First row: 1 = 1, 2=2, 3=2+1, 5 — 3-2: § 55FF39 14— 
8--54-1, 23 = 14+8+ 1, 39 = 234-14 4-2, 65 = 39+23+3, 109 = 65+39+5 
(in the sixth entry 1 is the pratinidhi). 

Second row: 2 = 1+1, 5=2+1+2, 10 = 5+2+3, 20 = 10+5+5, 38 = 
20+ 10-- 8, 73 =38+20+ 14+ 1. 

For example, in the 6-druta prastara, there are 14 tala-forms with 0 gurus and 
5 with 1 guru. In the 9-druta prastara, there are 65 tala-forms with 0 gurus, 38 
with 1 guru and 3 with 2 gurus. 
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| | | [1[2] 5 | 10 | 20 | 38 | 
[1] 2] 3]5]8] 14 | 23 | 39 [65] 
(ri2[s]4[s5[6] 7] 8 [9 | 10 


TABLE 15. Guru-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 1, 2 etc., gurus respectively. 


If Gz denotes the number of tala-forms with k laghus in the n-druta prastara, 
then we can easily see that we have the following relation analogous to (23): 


U= & xx?  rx* exp! = >> [> or xn Q8) 


n k«n 


From (28) we obtain the following recurrence relation 
Gi =G +G? 4+ GII +G (forn>6andk>0), — Q9) 


which is clearly the basis for the construction of the guru-meru as described 
above. 


3.15. Pluta-meru 


PRBTIC CERES APEA wT | 
TU: Kaa: ERNA Aa: | 
A:TA werent at RATAT | 
Rasana: was: wd WET d 
Yet tat Tees Sn J WAKA 


AANA: 4.348-34) 


The pluta-meru is to be constructed to compute the number of tāla forms in a 
prastāra which have a given number of plutas. 

While considering an n-druta prastāra, start with a row which has n entries. 
The row above will have five columns less and the subsequent rows will have 
six columns less at each stage. In each row, the first entry from the left will be 
1. In the first row, place 2 at the second place. The other places are filled with 
the sum of the previous, second-previous and the fourth-previous entries. When 
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the fourth-previous entry is not there, the third-previous entry is to be used as 
the representative. In the other rows, the places are filled with the sum of the 
previous, second-previous, and the fourth-previous entries in the same row, and 
the sixth-previous entry in the row below. Where the fourth-previous entry is not 
there, no representative is to be used in its place. The sum of all the entries in any 
column is always equal to the corresponding samkhyanka. 


HONOA 10 [ 11 | 


TABLE 16. Pluta-meru 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 1, 2 etc., plutas respectively. 


In Table 16, a bottom (or zeroth) row is introduced below the first row to denote 
the total value of the prastara. The rows of the meru are filled as follows. 

First row: 121, 2-22, 3=2+1, 6=3 +241, 10=6+4+3+41, 18= 
104-6 --2, 31 = 18+ 104-3, 55 = 31 3-18-- 6, 96 = 55 +31 + 10, 169 = 
96 4- 55 + 18, 296 = 169 + 96 + 31 (in the fourth entry 1 is the pratinidhi). 

Second row: 2 = 1+ 1, 5=2+ 1+2, 10=5+4+2+3, 22-104 54-14 
6, 44 =22+10+2+ 10. 

For example, in the 6-druta prastara, there are 18 tala-forms with 0 plutas and 
1 with 1 pluta. In the 9-druta prastara, there are 96 tala-forms with 0 plutas, and 
10 with 1 pluta. 

If P? denotes the number of tala-forms with k plutas in the n-druta prastara, 
then we can easily see that we have the following relation analogous to (23): 


(pC a ay = Sp ree e (30) 


From (30) we obtain the following recurrence relation 
Pf = pro! 4 pr? 4 pr-4 4 PUP  (forn > 6andk > 0) GD 


which is clearly the basis for the construction of the pluta-meru as described 
above. 
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3.16. Other Pratyayas 


After the discussion of the druta, laghu, guru and pluta-merus, Sàrhgadevà goes 
on to consider ten more pratyayas, namely the samyoga-meru, khandà-prastàra 
and the nasta and uddista processes associated with the druta, laghu, guru and 
pluta-merus. We shall here merely indicate the nature of these pratyayas, which 
have been dealt with in considerable detail by Sarngadeva in over fifty verses 
(verses 357-409) of Chapter V of Sarigitaratnàkara. 

The samyoga-meru gives the number of tala-forms in a prastara which have 
only D, or only L, or only G, or only P, or only D and L, or only D and G, and 
so on. This involves a consideration of all the 15 such combinations involving D, 
L, G and P—taken individually, in pairs, in triples and all the four together. 

The next pratyaya, khanda-prastara, is in fact a sub-prastara of a given n- 
druta prastara, and has only those tala-forms which conform to given specifica- 
tion, such as having no drutas, or one druta, or two drutas, etc., and similarly for 
laghus, gurus and plutas. Finally, the processes of nasta and uddista, for such 
a khanda-prastara, are discussed by Sarngadeva in terms of the corresponding 
merus. 
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Appendices 


Appendix A. Canonical decomposition and the rule of enumeration of 
Tala-prastara 


In this Appendix, we shall explain how Sarnigadeva’s rule of enumeration of the 
tāla- prastara has an elegant mathematical description if viewed in terms of what 
we have referred to as the canonical decomposition of a row number as a sum of 
the samkhyarikas S, or Sarügadeva numbers. We have already noted earlier (in 
our discussion of the nasta process) that the m-th row in the n-druta-prastara is 
obtained by taking the canonical decomposition of $, — m, writing 1 for all the 
patita numbers (< S,,) and O for the others, substituting P for the string 010110, 
G for 0110, L for 10 and D for the string consisting of the single symbol 0. The 
next lemma shows that for m = 5, — 1, this actually leads to the first row of the 
prastara as described by Sarügadeva. 


Lemma: The canonical decomposition of S, — | leads via the process of nasta 
to the first row of the n-druta-prastara, which has the maximum number of Ps at 
the extreme right followed by a DG or G or DL or L ora D on the left, depend- 
ing on whether n = 1,2,3,4,5 (modulo 6). The first row has only Ps if n = 0 
(modulo 6). 


Proof: If n = 1, S, — 1 = Oand the first row of the 1-druta-prastara is just D. 
If n = 2, $, —1 = 1 = 1.5, 4-0.5» and hence the first row of the 2-druta-prastara 
corresponds to the sequence 10 and is therefore an L. If n = 3,S, —1 = 2 = 
0.S1 + 1.S2 + 0.53, hence the first row of the 3-druta-prastara corresponds to the 
sequence 010 and is hence DL. If n = 4, $,—1 = 6—1 = 5 = 0.8;+1.52+1.53+ 
0.54, hence the first row of the 4-druta-prastara corresponds to the sequence 0110 
and is hence G. If n = 5, $,—1 = 10— 1 = 9 = 0.5, +0.S2 + 1.83 + 1.84 4- 0. S5. 
hence the first row of the 5-druta-prastara corresponds to the sequence 00110 or 
DG. Ifn = 6, Sı —1 = 19 — 1 = 18 = 0.5, + 1.85 + 0.83 + 1.84 + 1.85 + 0.Sg. 
hence the first row of the 6-druta-prastara corresponds to the sequence 010110 
and is hence P. 

Let now n > 6. We have 


Sn = Sn-6 + Sn—4 + Sn—2 + Sn-1 
so that Sn = 1 Sn-6 — 1 + $424 + Sn—2 ELS Sn-1 (32) 


with n — 6 > 0. We may assume by induction on n that the canonical decompo- 
sition for S,-6 — 1 leads to the first row of the (n — 6)-druta-prastara. In view 
of the above equation, the row corresponding to the canonical decomposition of 
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Sn — 1 is obtained by adding only one more P to the right of the first row of the 
(n — 6)-druta-prastara and our lemma is thus proved.!4 

We shall now see that the above lemma can be used to relate the rule of Sarnga- 
deva for passing from the m-th row of the n-druta-prastara to the (m + 1)-th row, 
via the change in the canonical decomposition of S, — m to that of S, — (m + 1). 

Suppose that the first non- druta element (from the left) of the m-th row is a P. 
Then the canonical decomposition of S, —7 has the binary form 00 . ..010110.... 
Let us assume that the first 1 in the above string occurs at the k-th place (counted 
from the left), i.e., that the canonical decomposition of S, — m begins with a Sx. 
Therefore, in the canonical decomposition of S, — (m 4-1), the 1 in the k-th position 
in the canonical decomposition of S,, — m changes to a zero and, in order to obtain 
the canonical decomposition of S, — (m + 1), we need to know the canonical 
decomposition of S — 1, which has been obtained in the above lemma. Thus we 
see that while going from the m-th to (m + 1)-th row, the P (above) changes to 
a G and we have to insert, from right to left, as many Ps as possible followed by 
DG, or G or DL or L or D or none of them as the case may be. The symbols 
to the right of P in the m-th row remain unchanged in the (m + 1)-th row. This 
is precisely the rule given by Sarngadeva for going from a given row to the next 
in the prastara, when the first non-druta element in the row (going from left to 
right) happens to be P. If the first non-druta element in the m-th row of a prastara 
happens to be G or L, we can use a similar argument to show that in the next row 
this symbol is changed to L or D respectively, and the rest of the symbols of the 
row on the left are obtained exactly as given by the rule of Sarngadeva, and those 
on the right remain the same. 

Let us illustrate this by an example. The 15-th row (from the top) of the 7- 


druta prastara with 33 rows (see Table 6) is P D and corresponds to the canonical 
decomposition 


$; — 15 33 — 15 = 18 =2+6+10 


0.S; + 1.82 + 0.83 + 1.84 + 1.S5 + 0.S6 + 0.87 


l4we may note that, in the case of matra-vrttas, the above lemma which gives the first row of a 
prastara of value n translates itself into the following simple identities between Virahanka-Fibonacci 
numbers (see the article of R. Sridharan, cited earlier, p. 133): 


si +53 +... + S2- = sax —1 (ifn = 2k) 
Sts t... +S = sog —1 (ifm = 2k +1) 
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This corresponds to the 7-tuple (0, 1, 0, 1, 1, 0, 0). The next row is associated with 
the canonical decomposition 


$;—16 = 33-—16=17=1+6+10 
(S2 — 1) + 0.83 + 1.84 + 1.85 + 0.86 + 0.85 
1.8, + 0.5 + 0.83 + 1.84 + 1.85 + 0.86 + 0.85 


This corresponds to the 7-tuple (1, 0,0, 1, 1, 0, 0) and represents the row LGD 
(see Table 6). That is, P is replaced by LG and the symbol D to the right of P 
remains unchanged. 

In Table 17, we have listed the 33 binary sequences of length 7 that correspond 
to the rows of the 7-druta-prastara. We note that the last row whose entries are all 
0 corresponds to the canonical decomposition of 0. The row above it is (1, 0, ..., 0) 
is got by increasing the first zero of the last row by 1. This corresponds to replacing 
Sı — 1 = 0 by Sı = I. In fact, in passing from the m-th row to the (m + 1)-th 
row from below, a certain zero entry becomes 1 and the rest of the entries to the 
left become 0 and entries to the right are unchanged. This, indeed, corresponds 
to replacing the canonical decomposition of Sp — 1 for some k < n (in a n-druta 
prastara), by Sx. 

For instance, in Table 17 below (for n = 7), the passage from 27th row 
(counted from below) which corresponds to the binary sequence (0, 1, 0, 1, 0, 1, 0) 
to the 28th, the sequence in question gets replaced by (0,0, 1, 1, 0, 1, 0) which 
corresponds to the replacement of $3 — 1 by $3. In the same way. the pass- 
ing from the 15th row (counted from below) which corresponds to the binary 
sequence (0, 1, 1,0, 1,0, 0) to the 16th, the above sequence gets replaced by 
(0, 0, 0, 1, 1,0, 0), which corresponds in turn to the replacement of $4 — 1 by 
$4. 

It is interesting to note that the process is analogous to the rule for going from 
one row to the next in the case of tana prastara, where it is discussed in terms of 
the factorial representation (see Proposition 2, Sec. 2). 

We shall now indicate a method of constructing the binary sequences corre- 
sponding to the rows of the prastara of value n, assuming that such a description 
is possible for the prastara of value n — 1. To do this, we first show how the n- 
druta prastara itself can be obtained through the knowledge of the (n — 1)-druta 
prastara. 

As we have remarked earlier, the prastara of value n is obtained by first writ- 
ing down the prastara of value n — 6, augmented by P, adding below this the 
prastara of value n — 4 augmented by G, adding below this the prastara of value 
n — 2 augmented by L and finally adding below this the prastara of value n — 1 
augmented by D. We will see how the prastaras of values n — 2, n — 4 and n — 6 
sit inside the prastara of value n — 1. If we take the 5,» rows of the prastara of 
value n — | ending with D and omit the last D, we obtain the prastara of value. 
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TABLE 17. Canonical decomposition of rows of Seven-druta-prastàra 


n — 2. If we take the S,_4 rows of the prastàra of value n — 1 ending with DL 
and omit the last two entries DL from the rows, we obtain the prastara of value 
n — 4. This is because, if we take the rows of the prastara of value n — 1 ending 
with L and omit the last entry L, we obtain the prastra of value n — 3; now, if we 
take the rows of the prastara of value n — 3 ending with a D and omit this D, we 
obtain the prastara of value n — 4. It follows by a similar argument that if we take 
the Sn—6 rows of the prastara of value n — 1 ending with DLL and delete these 
three entries from the rows, we obtain the prastara of value n — 6. From the above 
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remarks, we deduce the following: 


Proposition: The prastara of value n is obtained from the prastara of value 
n — l as follows: 


We take all the rows of the prastara of value n — | and augment these by a 
D at the end. These form the last S„—ı rows of the prastara of value n. 


Gi 


(ii) We take the rows of the prastara of value n — 1 ending with a D and replace 
this D by an L. These form the next S„—2 rows of the prastara of value n 
occurring above the rows constructed in (i). 


(iii) We take the rows of the prastara of value n — | ending with DL and replace 
the last two entries DL of these rows by a G. These form the next S,—4 rows 
of the prastara of value n occurring above the rows constructed in (ii). 


(iv) Finally, we take the rows of the prastara of value n — 1 ending with DLL 
and replace the last three entries DLL of these rows by a P. We then obtain 
the first $,-6 rows of the prastara of value n. 


In a similar vein, we can construct the binary sequences of length n associated with 
the prastara of value n, assuming that we have already constructed the binary se- 
quences of length n — | associated with the prastara of value n — 1. To carry 
out this construction, we need to write down the canonical decomposition of any 
number less than S,, assuming that we have written down the canonical decompo- 
sition of any number less than S,—;. Now, any integer less than 5, is either less 
than S„—ı or of the form S,—; +i where 0 < i < Sn — S,~). Since 2Sn-1 > Sn, 
we have i < S$,—,. Therefore we can write down the canonical decomposition of 
any number less than S, (assuming that we have written down the canonical de- 
composition of any number less than S„—1). This leads to the following procedure 
for the construction of the binary sequences of length n that we are looking for. 

In order to write down the binary sequences of length n associated with the 
prastara of value n: 


(i) We take all the binary sequences of length 1 — 1 corresponding to the prastára 
of value n — 1 and augment these sequences on the right by 0. These form 
the binary sequences corresponding to the last S,—; rows of the prastara of 
value n. This amounts to augmenting each of the rows of the prastdra of 


value n — 1 by D. 


(ii) The binary sequences associated with the first Sn —S,—1 rows of the prastara 
of value n are obtained as follows: We take the binary sequences associated 
with the last Sn — S,—1 rows of the prastaára of value n — 1 and replace 
the last O of these rows by the string 10. We can verify that this procedure 
amounts to taking the rows of the prastara of value n — 1 ending with a D 
and replacing this D by an L, taking the rows of the prastara ending with 
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TABLE 18. Unmerufor Tala Prastara 


DL and replacing the DL by G and finally taking the rows of the prastara 
ending with DLL and replacing the DLL with a P. 


Remark: It is interesting to see what happens if we apply the procedure out- 
lined in (ii) to all the binary sequences associated with the prastara of value n — 1. 
For example, if n = 8, Sg — S7 = 60 — 33 = 27. We have already seen in (ii) 
what happens if we take the last 27 rows of the 7-druta prastara and replace the 
last 0 of these sequences with the string 10. Now, if we take binary strings associ- 
ated with the first six rows of the 7-druta prastara and replace the last O of these 
sequences with the string 10, then we see that in each case one of the conditions 
(a)-(e) (mentioned in Proposition 3 of Sec. 3) will be violated. 


Appendix B. Narayana's unmeru method for nasta and uddista 


Narayana Pandita in his Ganita-kaumud: (c. 1356) discusses the nasta and 
uddista processes for his generalization of the matra-vrtta-prastara. For this pur- 
pose, he introduces what he refers to as the unmeru. Though Narayana Pandita's 
generalization of the ratra-vrtta-prastara does not subsume the tala-prastara as 
considered in Sarigitaratnakara, his unmeru method can be suitably altered so as 
to give an alternative method for nasta and uddista. 

We shall illustrate the modified unmeru method for nasta and uddista by 
considering the example of the 7-druta prastara. To construct the unmeru, in 
the bottom row place the number 1, followed by the samkhyarikas up to 33, 
the samkhyarika of the given prastara. In the column above the samkhyankas, 
place the integers, 1, 2, 3, 4, etc., starting with one entry above the penultimate 
samkhyanka 19, with the number of entries in each subsequent column increasing 
by one, as we move from right to left. Then, remove all entries in the columns 
above the samkhyarikas, except the values 1,2, 4, 6 corresponding to our four 
talangas. Thus the unmeru is formed as in Table 18. 
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B.1. Nasta-vidhi 


Given any number less than or equal to 33, find the set of patita and apatita 
samkhyankas as usual. Starting from the right, note the entry in the first row 
above the first apatita (if there is no entry go to the next apatita along the same 
row). In the row above the top most entry of the column of that apatita, move left 
till you reach the column of the next apatita. Note the corresponding entry (or 
move on in the same row till the next apatita, if there is no entry). And so on. 


Ex. 1: To find the 8'" tala-form in the 7-druta prastara. 


The apatita are 33, 10, 3, 2, 1, 1. We get the tala-form 11122 or DDDLL. 


Ex. 2: To find the 15'" tala-form in the 7-druta prastara. 


33, 19,3, 1, 1 are apatita. We get the tala-form 61 or PL. 


Ex. 3: To find the 28'" tala-form in the 7-druta prastara. 


33, 19, 10, 6, 1, 1 are apatita. We get the tala-form 4111 or GDDD. 
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Ex. 4: To find the 2”¢ tala-form in the 7-druta prastara. 


33, 6, 3. 1, 1 are apatita. We get the tala-form 124 or DLG. 

Thus, we see that the nasta-process becomes somewhat simpler when we make 
use of the unmeru. Since the uddista-process is quite simple in the standard 
formulation itself, there is no need to discuss the process in terms of the wnmeru. 


Appendix C. Druta-meru proposed by Akella Mallikarjuna Sarma 


The druta-meru described in the Sarigitaratnakara and displayed explicitly in 
commentaries, does not render transparent the recursion relation 


Dj = Des} + D}? + De“ + Dr 

satisfied by D? (the number of tala-forms with k-drutas in the n-druta-prastara). 
As is clear by its construction, the successive columns of the druta-meru corre- 
spond to 1-druta, 2-druta etc., prastaras. But, as we noted earlier, the successive 
rows have a different interpretation, depending on whether we are considering odd 
or even columns. The row above the bottom row alternatively gives the number of 
tala-forms with 1 druta and 0 druta; the next row alternatively gives the number 
of tala-forms with 3 drutas and 2 drutas, and so on. 

In other words, the druta-meru proposed in Sarigttaratnakara is in some sense 
obtained by fusing together successive rows of a more spread-out druta-meru 
where each row corresponds to a fixed number of drutas. This spread-out druta- 
meru will have a whole lot of 0 entries, as there are no £ala-forms with even 
number of drutas in a n-druta prastara where n is odd and vice versa. Such a 
spread out druta-meru has been proposed by Akella Mallikarjuna Sarma (see for 
instance, his book Sarigitaranakara: A critical Interpretation, Hyderabad 2001, 
p- 26), and is displayed in Table 20. The relation 


Di = Diz} + D}? + DEt + DES 
can easily be seen to be at the basis of the spread-out druta-meru (see Table 20), 


as the indices n, k now correspond to the columns and rows respectively in a one 
to one manner. 
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TABLE 19. Druta-meru of Sarigitaratnakara. 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 
have 0, 2, 4, etc., drutas respectively when n is even, and the number of 
tàla-forms which have 1, 3, 5, etc., drutas respectively when n is odd. 
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1| Of 8] o| 44] 0] 200 

1]. 0] 7] 0j35] ms 4 7a 

1Ilo[ 6| o|27| 0] 104] 0]|357 

1|10[5| o[20] 0|70| 0[/220| 0 

1/o[4]0] 14] 0|44| 0|125| 0]|336 
1|o|3s[o[9o|] o[l25|] o|[63|] o[153| 
1/o[2]/o[s[o] 12] oj26] o| 56| 0] 118 
(1[o[1To[2]o|4] of 7| oj13|] 0} 24] 0 
o[1I2[3[4[5[e6]| 7| 8| 9]10] 11] m] 13) 


TABLE 20. Alternative form of Druta-meru due to Akella 
Mallikarjuna Sarma. 


The different columns correspond to prastaras of different values which 
are indicated in the bottom row. The entries in the successive rows above 
the bottom row in the n-th column give the number of tala-forms which 


have 0, 1, 2 etc., drutas respectively. 
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What's so Baffling About Negative 
Numbers? — a Cross-Cultural 
Comparison 


David Mumford 


I was flabbergasted when I first read Augustus De Morgan's writings about negative 
numbers!. For example, in the Penny Cyclopedia of 1843, to which he contributed 
many articles, he wrote in the article Negative and Impossible Quantities: 


It is not our intention to follow the earlier algebraists through their different uses 
of negative numbers. These creations of algebra retained their existence, in the 
face of the obvious deficiency of rational explanation which characterized every 
attempt at their theory. 


In fact, he spent much of his life, first showing how equations with these meaningless 
negative numbers could be reworked so as to assert honest facts involving only 
positive numbers and, later, working slowly towards a definition of abstract rings 
and fields, the ideas which he felt were the only way to build a fully satisfactory 
theory of negative numbers. 

On the other hand, every school child today is taught in fourth and fifth grade 
about negative numbers and how to do arithmetic with them. Somehow, the aversion 
to these ‘irrational creations’ has evaporated. Today they are an indispensable part 
of our education and technology. Is this an example of our civilization advancing 
since 1843, our standing today on the shoulders of giants and incorporating their 
insights? Is it reasonable, for example, that calculus was being developed and the 
foundations of physics being laid — before negative numbers became part of our 
numerical language!? 

The purpose of this article is not to criticize specific mathematicians but first 
to examine from a cross cultural perspective whether this same order of discovery, 
the late incorporation of negatives into the number system, was followed in non- 
Western cultures. Then secondly, I want to look at some of the main figures in 


IDe Morgan’s attitudes are, of course, well known to historians of Mathematics. But my naïve idea 
as a research mathematician had been that at least from the time of Newton and the Enlightenment an 
essentially modern idea of real numbers was accepted by all research mathematicians. 
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Figure 1. Augustus De Morgan 


Western mathematics from the late Middle Ages to the Enlightenment and examine 
to what extent they engaged with negative numbers. De Morgan was not an isolated 
figure but represents only the last in a long line a great mathematicians in the West 
who, from a modern perspective, shunned negatives. Thirdly, I want to offer some 
explanation of why such an air of mystery continued, at least in some quarters, to 
shroud negative numbers until the mid 19" century. There are several surveys of 
similar material” but, other than describing well this evolution, these authors seem 
to accept it as inevitable. On the contrary, I would like to propose that the late 
acceptance of negative numbers in the West was a strange corollary of two facts 
which were special to the Western context which I will describe in the last section. I 
am basically a Platonist in believing that there is a single book of mathematical truths 
that various cultures discover as time goes on. But rather than viewing the History 
of Mathematics as the unrolling of one God-given linear scroll of mathematical 
results, it seems to me this book of mathematics can be read in many orders. In the 
long process of reading, accidents particular to different cultures can result in gaps, 
areas of math that remain unexplored until well past the time when they would have 


Three references are (i) Jacques Sesiano, The Appearance of Negative Solutions in Medieval 
Mathematics, Archive for History of the Exact Sciences, vol. 32, pp. 105-150; (ii) Helena Pycior, 
Symbols, Impossible Numbers and Geometric Entanglements, Cambridge Univ. Press, 1997; (iii) Gert 
Schubring, Conflicts between Generalization, Rigor and Intuition, Springer 2005. 
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been first relevant. I would suggest that the story of negative numbers is a prime 
example of this effect. 

This paper started from work at a seminar at Brown University but was developed 
extensively at the seminar on the History of Mathematics at the Chennai Mathemat- 
ical Institute whose papers appear in this volume. I want to thank Professors P. P. 
Divakaran, K. Ramasubramanian, C. S. Seshadri, R. Sridharan and M. D. Srinivas 
for valuable conversations and tireless efforts in putting this seminar together. On 
the US side, I especially want to thank Professor Kim Plofker for a great deal of 
help in penetrating the Indian material, Professor Jayant Shah for his help with both 
translations and understanding of the Indian astronomy and Professor Barry Mazur 
for discussions of Cardano and the discovery of complex numbers. I will begin 
with a discussion of the different perspectives from which negative numbers and 
their arithmetic can be understood. Such an analysis is essential if we are to look 
critically at what early authors said about them and did with them. 


1. The Basis of Negative Numbers and Their Arithmetic 


It is hard, after a contemporary education, to go back in time to your childhood 
and realize why negative numbers were a difficult concept to learn. This makes it 
doubly hard to read historical documents and see why very intelligent people in the 
past had such trouble dealing with negative numbers. Here is a short preview to try 
to clarify some of the foundational issues. 

Quantities in nature, things we can measure, come in two varieties: those which, 
by their nature, are always positive and those which can be zero or negative as well 
as positive, which therefore come in two forms, one canceling the other. When one 
reads in mathematical works of the past that the writer discards a negative solution, 
one should bear in mind that this may simply reflect that for the type of variable in 
that specific problem, negatives make no sense and not conclude that that author 
believed all negative numbers were meaningless’. Below is a table. The first five are 
ingredients of Euclidean mathematics and the sixth occurs in Euclid (the unsigned 
case) and Ptolemy (the signed case, labeled as north and south) respectively. 

What arithmetic operations can you perform on these quantities? If they are 
unsigned, then, as in Euclid, we get the usual four operations: 


1. a+bOK 


2. a — b but only if a > b (as De Morgan insisted so strenuously) 


3] believe the discovery of Calculus and, especially, simple harmonic motion, the differential equa- 
tions of sine and cosine, in India and the West provide a second example. 

"For example, Bhaskara II has a problem in which you must solve for the number of monkeys in 
some situation, and obviously this cannot be negative. 
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TABLE I 
Naturally Positive [ 
Modern units Quantities al Signed Quantities 
positive integer # of people/monkeys/ 
positive real proportion of 2lengths | 
(Euclid, Bk V) 
meters length of movable rigid j 
bar/stick 
E a 
meters“ area of movable rigid flat | 
object ls sn 
meters: volume of movable rigid 
object or incompressible 
Measure of a plane angle | distance N/S of equator 
dollars fortune/debt; profit/loss; | 
E —— asset/liability 
m (a) distance on line/road, rel. 
to fixed pt, the ‘number line’ 
(b) also, height above/below 
the surface of earth. E 
seconds [ime before or after the 
E present or relative to a fixed 
event 
| velocity on a line, forwards or 


backwards 
Fahrenheit or Celsius 
temperature 


Kelvin temperature 


Mass or weight of an 
object 


degrees (of 
temperature) 


3. a * b OK but units of the result are different from those of the arguments, 
e.g. length x length — area, length x length x length — volume 


your weight on a scale 
— force of gravity on your 
body (a vector) 


4. a/b OK but again units are different, 
e.g. length / length = pure number, area / length = length 


If they are signed quantities, addition and subtraction are relatively easy — but 
modern notation obscures how tricky it is to define the actual operation in all cases! 
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TABLE II 
First Second v | F. ] 
| summand | summand Sum | Difference | 
a b usual a 4- b r: a—bifa - b ie 
| | (neg)(b — a) if b > a 
(neg)a (neg)b | (neg)(a + b) | b—aifb-a | 
| | (neg)(a — b) ifa > b | 
a | (neg)b | a—bifa>b as, a+b 
(neg)(b—a)ifb>a | 
| (neg)a b | b—aifb-a | (neg)(a + b) 


T 


IN | (nega — b) ifa > b 


We write the simple expression a — b, and consider it obviously the same as any of 


these: 
a+(—b)=a—(+bh)=a+(-l)-b 


but each is, in fact, a different expression with a different meaning. Given an ordinary 
positive number a, —a is naturally defined as the result of subtracting a from 0. For 
a minute, to fix ideas, don't write —a, but use the notation (neg)a for 0 — a. Then 
note how complicated it is to define a + b for all signs of a and b. Starting with a 
and P positive, Table II gives the sums and differences of a and (neg)a with b and 
(neg)b, 

Understanding this table for the case of addition seems to be the first step in 
understanding and formalizing negatives. The second step is to extend subtraction 
to negatives so as to get the last column. This is contained in the rule: 


a — (—b) =a +b, forall positive numbers a, b. 


The basic reason for this is that we want the identity a — x + x = a to hold for all x, 
positive or negative or, in other words, subtraction should always cancel out addition. 
If we take x equal to —b, then replacing a — (—5) by a + b makes this identity 
hold. The argument one finds in some historical writings may be paraphrased as 
"taking away a debt of size x is the same as acquiring a new asset of size x”, a fact 
obvious to any merchant. In any case, understanding of negatives up to this point 
seems to be a natural stage that one encounters in various historical documents. In 
modern terminology, while acknowledging that our modern words distort historical 
truth, one would paraphrase this stage by saying that it incorporates the idea that 
the integers, positive and negative are an abelian group under addition. 

But multiplication of negatives is a subtler operation, the third and final step in 
the arithmetic of negatives. Modern notation again obscures the subtlety. When you 
write the simple identity —a — (—1)-a, you are making a big step. Perhaps this is a 
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contemporary mathematician splitting hairs because historically this seems to have 
been assumed as completely natural by nearly every mathematician once they knew 
the rules for subtracting negative numbers (with the exception perhaps of Cardano 
and Harriot, see below). One difficulty in arguing for this rule is that there are not 
many simple cases of quantities in the world where the units of the two multiplicands 
allow us to infer the multiplication rule using our physical intuition about the world. 
Here are a number of ways of arguing that the identity (—1)-(—1) = +1 must hold. 


Method I: Use the basic, intuitively obvious, identity: 
distance = velocity x time 
and argue that if you substitute: 


(a) velocity = movement of one meter backwards per second, a negative number, 
(b) time — second in the past, also negative, 
(c) then one second ago, you were | meter ahead, i.e. distance = +1 meter. 


This ‘proves’ (—1) - (—1) = +1. 


Method I’: I know of only one other real world situation where the rule is intuitively 
obvious. This variant of the previous argument concerns money and time. We use 


the simple equation obvious to any merchant describing the linear growth of a 
business’s assets: 


assets at time t = (rate of change of assets) x (elapsed time t) + (assets at present) 


Now suppose a business is /osing $10,000 a year and is going bankrupt right now. 
How much money did it have a year ago? Substitutes = — 1, rate = — 10000, present 
assets = 0 and the obvious fact that assets a year ago = +10000 to conclude that 
(—1) - (—10000) = +10000. 


Method II: (as in Euclid’s geometric algebra) 
In Euclid, multiplication occurs typically when the area of a rectangle is the 
product of the lengths of its two sides. Consider the diagram below: 
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The big rectangle has area a - b but the shaded rectangle has area (a —c)-(b — d). 
Since the area of the shaded rectangle equals the area of the big rectangle minus the 
areas of the top rectangle and the left rectangle p/us the area of the small top-left 
rectangle (which has been subtracted twice), we get the identity 


(a —c) x (Db—d — ab —bc—ad —t-cd, if a,b,c,d > 0,a > cb» d 


Now we use the idea that identities should always be extended to more general 
situations so long as no contradiction arises. If we extend this principle to arbitrary 
a, b, c, d, (which will bring in negative lengths and areas), we get fora = b = 0: 


(—c)(—d) = +cd 


This approach is probably the most common way to derive the multiplication rule. It 
can be phrased purely algebraically if you extend the distributive law to all numbers 
and argue like this (using also 0 - x = O and 1 - x = x): 


1 = 14+(-1)-0 =1+(—1)-(1+(—1)) = 14+(—1)-14+(-1)-(-1l) = (—1)-(—1)- 
Method III: Start with the multiplication 
(positive integer 7) x (any quantity a) = (more of this quantity na) 


(e.g. 4 x (quart of milk) = a gallon of milk), then by subdividing quantities as well 
as replicating them, you can define multiplication 


(positive rational) x (quantity a) 
and by continuity (as in Eudoxus), define 
(positive real) x (quantity a) 


What we are doing is interpreting multiplication of any quantity by a positive 
dimensionless real number as scaling it, making bigger or smaller as the case may 
be. Now if the quantity involved is signed you find it very natural to interpret 
reversing its sign as scaling by —1, i.e. to make the further definition: 


(—1) x (quantity a) — (quantity — a) 


Now you have multiplication by any real number, positive or negative. In other 
words, the negative version of scaling is taking quantities to their opposites. 

The core of this argument is the algebraic fact that the endomorphisms of an 
abelian group form a ring and we are constructing multiplication out of addition as 
composition of endomorphisms. This makes the third approach arguably the most 
natural to a contemporary mathematician trained in the Bourbaki style. 
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2. Negatives in Chinese and Indian Mathematics 


We will discuss China first. The classic of Chinese mathematics is the Jiuzhang 
Suanshu (Nine Chapters on the Mathematical Art). Like Euclid, this is a com- 
pendium of the mathematical concepts and techniques which had been developed 
slowly from perhaps the Zhou (or Chou) dynasty (begins c.1000 BCE) through the 
Western Han dynasty (ending 9 CE). Unlike Euclid, it is a list of practical real world 
problems and algorithms for their solution, without any indication of proofs. Since 
then, the Nine Chapters had a long history of ups and downs, sometimes being 
required in civil service exams and sometimes being burned and nearly lost. Each 
time it was republished though, new commentaries were added, starting with those 
of the great mathematician Liu Hui in 263 CE and continuing through those in the 
English translation by Shen, Crossley and Lun’. Page numbers in our quotes are 
from this last edition. 

Starting some time in the first millennium BCE, arithmetic in China began to 
be carried out using counting rods, which were arranged in rows using a decimal 
place notation. When doing calculations, different numbers were laid out by rods 
in a series of rows, forming a grid: a Japanese illustration of how they were used is 
shown in the figure below. 


Figure 2. A Japanese illustration of calculation with counting rods 


The section of the Nine Chapters in which negative numbers are introduced 
and used extensively is Chapter 8, Rectangular Arrays. This Chapter deals with 
the solutions of systems of linear equations and expounds what is, to all intents 
and purposes, the method of Gaussian Elimination. In fact, it is indistinguishable 
from the modern form. The coefficients are written cut in a rectangular array of 
rod numerals and one adds and subtracts multiples of one equation from another 
equation until the system has triangular form. Examples as large as five equations in 


5The Nine Chapters on the Mathematical Art: Companion and Commentary, Shen Kangshen, John 
N. Crossley, and Anthony W. -C. Lun, Oxford University Press, 1999. 
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five unknowns are worked. Naturally negative numbers appear all the time in such 
an algorithm. 

As described in Liu's commentary, red rods or upright rods were used for positive 
numbers which he calls gains (z/teng) and black rods or slanting rods for negative 
numbers which he calls losses (fu). He says "red and black counting rods are used 
to cancel each other". Curiously, his colors are the exact opposite of our Western 
accounting convention! Here is Problem 8 from this Chapter, p.409 in the Shen, 
Crossley and Lun edition: 


Now sell 2 cows and 5 sheep, to buy 13 pigs. Surplus: 1000 cash. Sell 3 cows and 3 pigs to 
buy 9 sheep. There is exactly enough cash. Sell 6 sheep and 8 pigs. Then buy 5 cows. There is 
600 coins deficit. Tell: what is the price of a cow, a sheep and a pig respectively? 


This means the three equations (all of which have negative coefficients as well as 
positive): 


2C 4-58 —13P" = 1000 
3C, 9. - 3 PO) 
—5C+6S+8P = —600 


The solution is found to be C = 1200, S = 500, P = 300. The Nine Chapters goes 
on rather mysteriously (p.404): 


Method: Using rectangular arrays lay down counting rods for each entry to be added. 

The Sign Rule 
Like signs subtract; opposite signs add; positive without extra, make negative; negative 
without extra, make positive. 
Opposite signs subtract; same signs add; positive without extra, make positive; negative 
without extra, make negative. 


Liu's commentary explains: the first set of sign rules refers to subtraction of array 
entries, the second to addition. He goes on to clarify the meaning of the cryptic 
Sign Rule. In fact, the rule is precisely what we wrote out in Table II above for both 
addition and subtraction. What is clear is that negative numbers were analyzed and 
treated correctly as soon as the need arose, presumably for the first time anywhere 
in the world. 

I cannot find in the Shen et al edition of the Nine Chapters any treatment of 
multiplication of negative numbers, although Martzloff quotes the Chinese edition 
of Qian Baocong as saying: “Rods of the same name multiplied by each other make 
positive. Rods of different names multiplied by each other make negative". In any 


®Jean-Claude Martzloff, A History of Chinese Mathematics, 2™ edition, Springer. 1997, page 203. 
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case, Liu’s commentary, written in the 3™ century CE, makes the remark (p.405): 
“Interchanging the red and black rods in any column is immaterial. So one can 
make the first entries of opposite sign.” This is the correct rule for multiplication 
by —1. 

Chinese algebra had a renaissance in the Song and Yuan (Mongol) dynasties. In 
particular, Zhu Shijie (c.1260—c.1320) extended the ideas of Gaussian elimination 
to the simultaneous solution of polynomial equations, inventing the equivalent of 
the resultant and using ever larger and more complex arrays of coefficients. At 
this stage, as one would expect, the full rules for negative arithmetic emerge quite 
explicitly as well those for the algebra of polynomials. Having a theory of negatives 
is the clear prerequisite for going further in the study of algebra. Zhu's algebra 
reached a stage not attained in Europe until the late 19 century. 

I want to turn to India next. In every culture, one of the main reasons for the 
development of arithmetic — arguably the principle driving force — is the need 
of merchants to keep accounts. In fact, it is even hypothesized that arithmetic and 
writing itself emerged in the 3 millennium BCE in Mesopotamia as a development 
of a crude system of tracking transactions of agricultural goods by means of small 
specially shaped and inscribed tokens’. By around the year 2000 BCE, one finds 
tablets from Ur? with a yearly summary accounting, showing budgeted and actual 
inputs (with value converted into a common unit of barley), budgeted and actual 
outputs, budgeted and actual labor and differences, shortfalls or profits! In India, very 
sophisticated principles of accounting were codified in Kautilya’s comprehensive 
manual of statecraft, the ArthaSatra written in the 4"" century BCE. The Arthasatra 
covers in amazing detail every aspect of setting up and managing of a kingdom 
(including managing a special forest for elephants). In Book II, Chapter 6 and also 
in many later Chapters of Book II, Kautilya details how accounts are to be kept?. 
He describes a complete system of book keeping: he has a ledger for income with 
dates, times, payers, categories, etc. and a ledger for expenditures and finally a 
third ledger for balances. There are sections on auditing, insurance against theft, 
debtors, borrowings, mortgages, auditing, etc. and subtler accounting issues such as 
current vs. deferred receipts, how to account for price changes of items in inventory, 
fixed vs. variable costs. Although he does not use negative numbers explicitly, he is 


The pioneer here has been Denise Schmandt-Besserat, who has brought her life’s work together in 
the multi-volume book Before Writing, volume I being From Counting to Cuneiform, University of Texas 
Press, 1992. In particular, she has “deciphered” the mysterious tokens found throughout the Middle East 
from roughly 8000 BCE to 3000 BCE, finding a simple method of accounting which merges seamlessly 
with highly developed cuneiform accounts in the 3 millennium. 

5See Chapter 5 in Richard Mattessich, The Beginnings of Accounting and Accounting Thought: 
Accounting Practice in the Middle East (8000 B.C to 2000 B.C.) and Accounting Thought in India (300 
BCE to the Middle Ages), Garland Publishing, 2000. 

See Chapter 6 in Mattessich, Op. Cit., which is based on the book Modern Accounting Concepts in 
Kautilya's Arthasastra by Anjan Bhattacharyya, Firma KLM, Calcutta, 1988. 
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clearly aware of how accounts must sometimes show a deficit and that people may 
carry a negative net worth. 

Although the Arthasatra does not mention negative numbers explicitly, they 
appear full blown in Brahmagupta's treatise Bráhma-sphuta-siddhànta (628 CE). 
The development of mathematics in India in the first millennium CE is connected 
much more strongly to astronomy than to accounting. For much of this period, 
treatises covering both mathematics (ganita) and astronomy (the motion of the sun, 
moon and planets and their positions at a given time and place in the sky) and called 
siddhantas were composed. Many of these were in verse, highly compressed and 
cryptic, meant to be memorized and handed down generation by generation from 
teacher to student. 

The Bráhma-sphuta-siddhánta includes two Chapters on mathematics which are 
a compendia of the mathematical concepts and techniques which had been devel- 
oped over previous centuries. Here we find all the correct rules for arithmetic with 
negative numbers and in it positive numbers are referred to as "fortunes", negative 
numbers as "debts". It appears that accounting led naturally to an arithmetic in 
which negative numbers took their natural place. Here are some quotations, show- 
ing first the rules we laid out in table I and then, significantly, going on to describe 


how to multiply negative numbers!?: 


[The sum] of two positives is positive, of two negatives, negative; of a positive and a negative 


[the sum] is their difference; if they are equal, it is zero. The sum of a negative and zero is 
negative, of a positive and zero positive, of two zeros, zero. 


[If] a smaller [positive] is to be subtracted from a larger positive, [the result] is positive; [if] 
a smaller negative from a larger negative, [the result] is negative; [if] a larger from a smaller, 
their difference is reversed — negative becomes positive and positive negative. 


The product of a negative and a positive is negative, of two negatives positive, and of positives 
positive; the product of zero and a negative, of zero and a positive, or of two zeros is zero. 


A positive divided by a positive and negative divided by a negative is positive; a zero divided 
by a zero is zero; a positive divided by a negative is negative; a negative divided by a positive 
is negative, 


Chapter 18, verses 30-34 


The only oddity seems to be his confident assertion that 0/0 = 0. The rest is 
as clear and modern as one could wish for. It would be wonderful to know what 
considerations led Indian mathematicians in the late centuries BCE or the early 
centuries CE to these conclusions — especially for the multiplication of nega- 
tive numbers. The predominately oral transmission of knowledge in the Vedic 


10We quote from the translation by Kim Plofker in her book, Mathematics in India, 500 BCE — 1800 
CE, Chapter 5, p.151. 
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tradition — and perhaps the difficulty of preserving perishable writing materials 
through yearly monsoons — has not left us with any record of these discoveries. 
They just appear full blown in Brahmagupta’s summary. R.Mattessich has devel- 
oped at length the idea that it was the highly developed tradition of accounting which 
led to the full understanding of negative numbers’! but unfortunately no evidence 
for this plausible conjecture exists. 

As in China, having negative numbers opened the way to deeper studies of 
algebra itself. Perhaps the deepest of these was the Indian work on Pell’s equation 
x? — Ny? = m, especially finding solutions for m = 1. Brahmagupta himself made 
the first huge step, discovering the multiplication law arising from the factorization 


x — Ny = (x + VN) (s — VNy) J 


More exactly, he showed how from solutions of the equation for m, m2, one gets 
one for their product m = m - m3. Some centuries later, Jayadeva found a complete 
algorithm for constructing solutions with m = 1. 

We find reflections of the Indian use of negatives in their astronomy too. As 
stated, the main goal of these scholars was not to develop mathematics for its own 
sake but to apply mathematics to predict the positions of the sun, moon and planets. 
An epicyclic theory is used and, for the planets, both a ‘slow’ and ‘fast’ correction 
is added to the mean motion of the planet (in our terms, one is due to the ellipticity 
of their orbit, the other to the shift from a heliocentric to a geocentric description). 
David Pingree'? has hypothesized that through the intermediary of the Indo-Greek 
empire, some version of the pre-Ptolemaic Hipparchan theory of planetary motion 
reached India. What is quite striking is that in making these corrections the sine 
function in all four quadrants is understood. Hipparchus had computed tables of 
chords, which are fundamentally unsigned positive quantities. The Indian tradition 
shifts to sines (actually *Rsines', sines multiplied a large radius and rounded to the 
nearest integer) and then it is natural to extend them from the first quadrant to the 
full circle. Here is a quote from the Bráhma-sphuta-siddhánta, Chapter 2, verse 16 
describing the corrections made by adding or subtracting appropriate sine function 
corrections to the mean position: 


(In successive quadrants) (in the slow case) negative, positive, positive, negative correction, 

otherwise in the fast case. (The sum) of two positives (is) positive, of two negatives (is) 

negative; of positive and negative (is their) difference, of equals (positive and negative is) 
13 

zero. 


!lsec Chapter 7 in Mattessich, Op. Cit. 

!2David Pingree, The History of Mathematical Astronomy in India, in Dictionary of Scientific 
Biography, Charles Gillespie editor, Scribner, 1978, pp.533-633. 

13Translation by J. Shah (personal communication). 
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It would be nice if they had drawn a graph of the correction in all quadrants, i.e. of 
the sine function, to clarify this verse, but that was clearly not their modus operandi. 
But further evidence that the sine function was seen as being extended to more than 
one quadrant comes from the rational approximation of the sine in the first two 
quadrants given by Bhaskara I (7" century CE)'*: 


J 16-60(x — 8) 
sin(@) + = 


re 0-6 <a 
5z? — 40(x — 0) 


This is an extraordinarily accurate approximation which would be hard to come up 
with if they had not grouped the first and second quadrant together. 

Another natural place for using negative numbers is for coordinates, e.g. to 
measure the celestial latitude (perpendicular to the ecliptic), or the declination (per- 
pendicular to the celestial equator), of a planet or star. Tradition, however, sanctifies 
describing latitudes and declinations as north/south instead of positive/negative and 
this is hard to change. But this latitude must often be put into formulas when convert- 
ing from celestial coordinates to horizon based coordinates, e.g. when calculating 
the very important rising times of planets. At this point, rules for negatives again must 
be used. Here is an example from Brahmagupta's Khandakhádyaka, Ch.6, verse 5'° 


Multiply the north celestial latitude by the equinoctial shadow and divide by 12; apply the 
quotient taken as minutes negatively or positively to (the longitude measured from) the orient 
and occident ecliptic points. When the celestial latitude is south, apply the resulting minutes 
to the same points positively or negatively. 


In modern terms (see Figure 3), he is computing (longitude KA) + (latitude K V ) 
x tan($), where @ is the observer's latitude and distinguishing the cases where 
longitude is measured eastwards or westwards and where the planet's latitude is 
north or south. 

An explicit interpretation of negatives as coordinates on a number line occurs 
later in the work of the 12" century Bhaskara II (so-called to distinguish him from 
the earlier 7^ century Bhaskara I). He wrote an immensely popular textbook on 
Algebra, the Lildvari'®. The title was said by a Persian translator to be the name of 
Bhaskara’s daughter and, although this is not made explicit in the book, it is full of 
verses addressed to the “beautiful one”, “the fawn-eyed one”, etc. Present day texts 
are so drab in comparison! 

The remarkable passage is in verse 166 and again it is given without any fanfare 
stating that a new interpretation of negative numbers is being given. But, to my 


'4Bhaskara I. Mahabhaskariya, Ch. 7, verses 17-19. 

The Khandakhádyaka of Brahmagupta, with the commentary of Bhattotpala, edited and translated 
by Bina Chatterjee, World Press, Calcutta, 1970, p.122—-3. 

16We follow the classic translation by H. T. Colebrooke, first published by in 1817 and subsequently 
reprinted in numerous editions. 
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Figure 3. Diagram for calculation of rising time. Planet at V, 
r K A ecliptic, P north celestial pole. 


knowledge, it is the first occurrence of the "number line", of using positive and 
negative numbers as coordinates on either side of an origin. Bhaskara is in the 
middle of a discussion of triangles and, specifically, given the three sides a, b, c of 
the triangle with a distinguished side c, the base, how to find the altitude and the 
position of the foot of the perpendicular dropped on the base. If you let x be the 
distance from one endpoint of the base to the foot, then (c — x) is the distance from 
the other endpoint to the foot and Pythagoras's theorem tells us: 


a? — x? = altitude? = b? — (c — xy 


which gives us: 
x — (a? c?— b*)/2c 
In verse 166, he poses the problem: 


Inatriangle, wherein the sides measure ten and seventeen and the base nine, tell me promptly, 
expert mathematician, the segments, perpendicular and area. 


and his formula gives him x = —6,c — x = 15 (see Figure 4). Aha: what to do? 
Well, if you draw this triangle, you find the foot of the perpendicular lies outside 
the base. So what does Bhaskara say? 


(The result 6) is negative, that is to say, in the contrary direction. Thus the two segments are 
found 6 and 15. From which, both ways too, the perpendicular comes out 8. 


This is stated so casually, as if it were common wisdom, that one can only 
conclude that this way of thinking about negative distances was well-known in his 
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Figure 4. A triangle with a perpendicular falling outside the 
base, Bhaskara II 


time. Nonetheless, as we will see, it doesn't occur in Europe before the work of 
Wallis near the end of 17" century. 


3. TheShunning of Negative Numbers, From Al-Khwarizmi to Galileo 


I now turn to the Arab and Western treatment of negative numbers. To keep the 
story within bounds, I will pick a small selection from the many figures who might 
be discussed, those who seem to me key figures in the story or who exemplify a 
particular stand. 


Al-Khwarizmi (c.790—c.840) 


It is repeated everywhere that the Indians invented zero and place notation and that 
the Arabs learned it from them and later transmitted this to Europe. It's bizarre 
that such a misunderstanding should be widespread but in fact, the Babylonians 
invented place notation (albeit using base 60) and their arithmetic was used by 
many Greeks, e.g. Ptolemy. I hope I have made the case that the most substantial 
arithmetic discovery of the Indians — and independently the Chinese — was not 
merely that of zero but the discovery of negative numbers. Sadly this discovery was 
not absorbed in any but a superficial way by the Arabs. 

Al-Khwarizmi (whose full name was Abu Ja'far Mohammad ibn Musa AI- 
Khwarizmi) was familiar with Indian mathematics and astronomy and apparently 
with Brahmagupta's Bráhma-sphuta-siddhánta written some 200 years earlier. He 
worked under the patronage of the caliph Al-Mamun about whom he says "That 
fondness for science, ..., that affability and condescension which he (the caliph) 
shows the learned... has encouraged me to compose a short work on calculating by 
Completion and Reduction .. such as men constantly require in cases of inheritance, 
legacies, partition, law-suits and trade . . . "'" His book on Algebra is entitled Al- 
jabr w'al muqabala which refers to the operations of completion and reduction with 


V'The Algebra of Mohammed ben Musa, Translated by Frederic Rosen. Facsimile reprint of 1831 
edition by the Oriental Translation Fund, London, Adamant Media Corporation 2002. 
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which he simplified his equations. These were relations between an unknown, its 
square and constants, given in prose. Nearly half of his book concerns incredibly 
complex inheritance cases. 

I find three things especially striking in this book. Firstly, negative numbers 
appear only once, in a section on multiplication whose goal appears to be to explain 
the identity 


(a—c):-(b—d)=ab—ad — bc + cd 


and justify it by geometry, just as in our discussion of “Method ITI" for multiplying 
negative numbers. But then they are never mentioned again. The second striking 
thing is that quadratic equations always have positive coefficients and thus belong 
to three types: 


l. ax? + bx = c,a,b,c > 0 (referred to as “roots and squares are equal to 
numbers") 


N 


2 T ^ 

. ax” +c = bx,a,b,c > 0 (“squares and numbers are equal to roots") 
2 LE ELI 

3. ax” = bx + c,a, b, c > 0 ("roots and numbers are equal to squares") 


This separation of cases continues down through the whole European tradition 
through De Morgan. An equation, in short, must be an identity between two positive 
numbers. Thirdly, he discusses exactly the same problem that Bhaskara II was to 
take up: finding altitudes of triangles whose sides are given. But, unlike Bhaskara, 
all the examples he treats have the foot of the perpendicular inside the base so this 
big clue about negatives never comes up. 


Leonardo of Pisa (1170-1250) 


Leonardo of Pisa was one ofthe first Europeans to master the Arab arts of calculation, 
including the use of Indian symbols and place notation. He wrote a remarkable book, 
his Liber Abaci (Book of Calculation), in which the rules for all the basic arithmetic 
operations are laid out in great detail and exhaustively illustrated by numerical 
examples. This occupies the first half of his book which is essentially what we 
would call a primer. But he deals exclusively with the arithmetic of positive integers 
and positive fractions. His section on subtraction is entitled On the Subtraction of 
Lesser Numbers from Greater Numbers. 

As in the Indian tradition, accounting was one of the principle stimuli for the 
development of arithmetic in the Middle Ages and much of the book deals with 
the arithmetic of money, goods and possessions. The second half of the book treats 
a huge number of “word problems" involving goods and money. He is following 
a curious tradition going back to Diophantus (and found in Chinese and Indian 
works also) of what, to modern eyes, are quite bizarre artificial “word problems" 
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involving a group of people who, after exchanging various sums of money, have 
sums satisfying some linear relationships. Here is an example: 


Three men had pounds of sterling, | know not how many, of which one half was the first's, one 
third was the second's and one sixth's was the thirds; as they wished to have it in a place of 
security, every one of them took from the sterling some amount, and of the amount that the 
first took he put in corimon one half, and of it that the second took, he put in common a third 
part, and of that which the third took, he put in common a sixth part, and from that which they 
put in common every one received a third part, and thus each had his portion. 


In modern algebra terms, if S is the sum of sterling and x; , x2, x3 are the sums which 
the three men took, so that (x; /2 + x2/3 + x3/6) is what “they put in common”, 
then the last sentence “each had his portion", sets up three equations: 


xy l E X2 2) " S 


2 30» 5 UNE: 
2x2 1 Xx; X2 a X3 e S 
3 ZEE epe ae 
5x3 1 XI X2 X3 S 
——— s EI e 
6 E roa a) 6 


This is only one of hundreds of such problems. He develops methods of laying 
out the coefficients in rows and manipulating the numbers to get the answer. In the 
above, the ‘answer’, is the smallest set of relatively prime x’s which solve these three 
homogeneous equations in 3 unknowns. Leonardo has a rather awkward and special 
version of the Chinese algorithm for solving linear equations in many unknowns. 

Now most of his problems are set up so all the numbers which occur are positive. 
But not all! First of all, negative numbers can arise in the course of the calculation. 
He then says things like!?: 


[he is in the middle of an algorithm]. . . and from the 240 you subtract 288 leaving minus 48, 
and this I say because the 288 cannot be subtracted from the 240; from this 48 you take 1/3 
for the 1/3 of the second position; there will be minus 16... . 


He is getting close to the red and black rods of the Chinese, but these examples are 
few and far between and are not pursued very far. In a few other cases, the answer 
itself is negative. For example, after solving the problem described in the first quote, 
he varies the proportions of S owned by the three men to 1/2, 2/5 and 1/10. In this 
case, the solution is x; = 326, x2 = 174 and x3 = —30. The setting of the problem, 
that all the x’s are amounts of money, comes to his rescue. The third man, he says, 


18] eonardo of Pisa, Liber Abaci, p.415 of the English translation by L. Sigler, Fibonaccis' Liber 
Abaci, Springer-Verlag, 2002. 
Ibid, p.419. 
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does not take anything from the sum $ which they share but instead puts in an 
additional 30 pounds of his own "proper" money: there were 470 pounds in all, and 
when they "wanted to have it in a place of security", the third man added 30, the 
first man took 326 and the second took 174. When money is concerned, negative 
quantities can always be given a simple meaning! 

Leonardo is making the first tentative steps towards enlarging the number system 
to include negatives. With money, he is comfortable with assets and debts, giving 


and taking. But his examples are few and he never makes explicit rules for extending 
arithmetic. 


Nicole Oresme (1323-1382) 


Nicole Oresme was a mathematically inspired scholastic, working in Paris in the 
mid-14" century. He made a giant stride taking geometry beyond Euclid. In his 
great book, Tractatus de configurationibus qualitatum et motuum (Treatise on the 
configurations of qualities and motions)*°, he proposed considering all intensities 
which varied in time and whose values at different times could be compared by a 
proportion. To any such quality, he proposed constructing a graph. First he took a 
line segment, called the subject, whose points represented the interval of time over 
which the quality was varying. This, in itself, was a radical departure from Euclid: 
now space was being used analogically, as a substitute for time. Then he proposes 
erecting line segments perpendicular to the subject whose lengths had the same 
proportions as the qualities being graphed: 


Therefore, every intensity which can be acquired successively ought to be imagined by a 
straight line perpendicularly erected on some point of the space or subject of the intensible 
thing, e.g. a quality. For whatever ratio is found to exist between intensity and intensity of the 
same kind, a similar ratio is found to exist between line and line, and vice versa.... Therefore, 
the measure of intensities can be fittingly imagined as the measure of lines. (Oresme, Li) 


He talks about graphing many things (although he never gathers data or actually 
goes beyond making simple cartoons of his graphs — see Figure 5). In particular, 
he discusses graphing velocity, temperature, pain and grace (of a soul). Some of 
these are clearly positive quantities by nature, e.g. pain and grace. He is interested in 
contrasting intensities which are constant (graph (a) in figure), intensities which vary 
at a constant rate (graph (b) in the figure) and intensities which are more complex 
(graphs (c) in figure). For example the grace of a soul ‘occupied by many thoughts 
and affected by many passions’ will be difformly difform — his name for type (c). 
On the other hand, velocities can clearly change sign and, as for temperature, he 
even considers there to be complementary intensities of hotness and coldness. For 


20T;anslations are from Marshall Clagett's translation, Nicole Oresme and the Medieval Geometry 
of Qualities and Motions, University of Wisconsin, 1968. 
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al 


Figure 5. Oresme’s examples of graphs 


temperature, hotness might have a graph with values f(x) and coldness a graph 
with values C — f(x). In other words, he adds a suitable positive constant so as to 
make every intensity positive everywhere. 

Because his graph is the whole area, not simply the curve at the tips of the his line 
segments, he cannot have a graph which goes from positive to negative, crossing 
the ‘subject’. This is especially striking because at one point he makes a catalog 
of various types of difformly difform graphs: but no graph in the catalog is, for 
example, regularly oscillating like a sine wave. He even hints at the fact that the 
area of the graph of velocity is the distance traveled, the fundamental theorem of 
calculus, but to make his picture, the velocity cannot change sign: no backtracking. 
Oresme has gone beyond Euclid in a striking way but he cannot make the further 
leap of allowing negative values for an intensity. 


Luca Pacioli (1445-1517) 


Pacioli’s importance is not due to his discoveries but to the fact that he wrote an 
encyclopedic work Summa de arithmetica, geometria, proportioni et proportion- 
alita which summarizes the contemporary knowledge of arithmetic, geometry and 
especially accounting. The work’s greatest influence was due to its description of 
double-entry book keeping which was a key step in the expansion of the interna- 
tional business enterprises which characterized the Renaissance. Here we find a 
small number of linear equations involving amounts of money whose solution is 
negative. As in Leonardo, when the result was a negative number, it is described as 
a debt. In one case, the price of an egg comes out negative — owning the egg puts 
you in debt so the sellers are paying you to take their eggs. 
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Sesiano (op.cit.), however, tracked down one isolated instance of a problem in 
Pacioli's writings which is more exciting. There is an untitled manuscript, writ- 
ten for his students in Perugia, which survives in the Vatican?. A standard class 
of problems (going back to Babylonian times) involves dividing a number into 
two parts which satisfy some quadratic condition. After solving some such prob- 
lems with positive solutions, he comes to what he calls the bellissimo caso. This 
example asks you to divide 10 into two parts the difference of whose squares is 
200. The reader may like to check that the answer is 10 = 15 + (—5). Here 
is a problem not only in pure numbers one of which is negative but requiring 
squaring this negative number. Although an obscure and forgotten footnote to his- 
tory, it seems that the young Pacioli ventured briefly into uncharted territory in 
a truly original way. It is unfortunate that in his Summa, he did not pursue these 
ideas. 


Girolamo Cardano (1501-1576) 


The only reason to include Cardano is that he wrote the book Ars Magna", so 
we can analyze how he thought, how he looked on negative as well as imaginary 
numbers. The solution of cubic equations was due to Scipione del Ferro, Pro- 
fessor of Mathematics at Bologna around 1515, and the solution of the quartic 
to Cardano's student Ludovico Ferrari. Cardano himself was an arrogant man, a 
compulsive gambler, who led a wild life of ups and downs. That he computed 
the odds of various sorts of gambling was arguably his greatest mathematical 
achievement. 

If Al-Khwarizmi had spun out the solutions for quadratic equations in to many 
different cases, Cardano really went to town describing how to solve 13 distinct 
cases of cubic equation (and 44 types of derivative cases). Why so many? Because 
(a) the coefficients all had to be positive and (b) the equation had to equate a positive 
quantity to another positive quantity. The many sections are entitled things like “On 
the cube and square equal to the first power and number, generally". Nonethe- 
less, he did recognize that some of his equations had negative solutions: these he 
called 


"fictitious (for such we call that which is a debitum or negative)" 
but he does very little with such roots, ignoring them systematically. But in the later 


Chapter, “On the rule for postulating a negative", he does explore a bit what algebra 


21Cod. Vat. lat. 3129. 


22Quotations are from the 2007 Dover reprint The Rules of Algebra: (Ars Magna), translated by JË 
Richard Witmer. 
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can do for you if you admit negative roots. His example of a problem requiring 
negative numbers is this: 


The dowry of Francis’ wife is 100 aurei more than Francis" own property, and the square (7) 
of the dowry is 400 more than the square of his property. Find the dowry and the property 


This works out to give Francis —48 aurei of property, that is, he is in debt 48 aurei, but 
fortunately is getting a dowry of 52 aurei. Here he correctly identifies the negative 
solution with a debt. This is an excellent illustration although squaring a sum of 
money is a pretty weird thing to do. 

There would little else to say except for the curve ball that was thrown to Cardano: 
for all cubic equations which have only one real root, del Ferro’s formula worked 
like a charm. But if there were three real roots (the other possibility, known as the 
casus irreducibilis), it gave an apparently meaningless result. His formula for the 
roots of the equation x? + ax + b = O is: 


(/2 + /-0/427)+) (0/2 - /-0/427), where D = —4a°—27b* 


D, the discriminant, is equal to the square of the difference of all pairs of distinct 
roots, hence it is positive if all the roots are real. So we need to find the a square 
root of a negative number even though in the end we only want the real number x. 
Cardano struggled unsuccessfully with what this might possibly mean. 

His one attempt to deal with these complex expressions is in the same Chap- 
ter, “On the rule for postulating a negative” mentioned above. Here he considers 
problems which have complex roots, such as the following: 


Divide 10 into two parts the product of which is 40. 


The usual quadratic formula gives the two parts as 5 + /—15 and 5 — /—15. This 
is also the answer his math gives him and which he puts in writing in his book but 
he doesn't attribute much meaning to it. He makes his famous comment: 


So progresses arithmetic subtlety, the end of which, as is said, is as refined as it is useless. 


At the end of this Chapter, he gives a third type of example where he reasons 
incorrectly with products of a real and an imaginary. In a later edition, he added 
an appendix De aliza regula liber in which he flirted with the idea that maybe 
(—1)? = +1 was wrong. Why not try (C1)? = —1? Between ‘fictitious’ neg- 
ative numbers and useless imaginaries, you get the sense that Cardano was at 
sea. 
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Galileo (1564-1642) 


Perhaps mathematicians were stuck thinking that negative numbers were fictitious 
but surely physicists who were actually measuring things in the real world, had a 
clearer view? Arguably, Galileo’s great contribution to physics was his recognition 
that momentum was a key property of objects, that it was constant when no forces 
were acting and that the force of gravity acting on projectiles and falling bodies 
changed their momenta at a constant rate, not their positions. As an old man, when 
the Pope commuted his sentence for heresy to house arrest, he wrote down these 
theories in his Dialog concerning Two New Sciences”. He starts off with his foil 
Simplicio getting put down again and again by Galileo's mouthpiece Salviato. But 
by the Fourth Day, Galileo lapsed into a more standard Euclid-style exposition 
and puts out the centerpiece of his theory: the demonstration that a projectile 
follows a parabolic arc under the force of gravity. Here was something he had 
actually experimented with and he was on solid ground, theoretically as well as 
experimentally. Figure 6 is an excerpt from his notebooks working on projectiles. 

The central assertion in these dialogs is that gravity endows the projectile with 
a constant downward acceleration. Thus its vertical velocity will be positive going 
up, zero at the peak and negative coming back down. It is a linear function changing 
from positive to negative. The math couldn't be simpler — if you are willing to use 
negative numbers. 

What does Galileo do? His main result is: 


Theorem 1. A projectile which is carried by a uniform horizontal motion com- 
pounded with a naturally accelerated vertical motion describes a path which is a 
semi-parabola. 


Note that he uses a semi-parabola: the half of the parabola in which height is 
a monotone function of time. Considerably later, after a long discussion of the 
time and distance of the semi-parabolic arc carrying the projectile to the ground, 
he reverses time without any discussion and concludes that the rising phase of a 
projectile is also a semi-parabola. 

The discussion continues on optimal angles at which to fire guns. But the as- 
tonishing point is that he never talks about the whole parabolic arc, with ascending 
and descending halves and how there is constant downward acceleration through- 
out the path. A// the diagrams in the book resemble the figure from his notes: a 
semi-parabola with some auxiliary chords and tangents. He analyzes the geometry 
of the semi-parabola and the physics of a falling body and then asserts without any 
discussion that one can reverse the direction of motion from a fall to a climb — 
nothing else. That the velocity changes at the apex from positive to negative is not 
stated anywhere. 


23Quotations are from the 1956 Dover edition, Dialogues Concerning Two New Sciences, translated 
by by Alfonso De Salvia Henry Crew. 
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Figure 6. Galileo's notes on projectiles 


Fermat (1601-1665) 


Fermat and Descartes, at essentially the same time, had the idea of introducing 
coordinates into the plane and connecting geometric loci with polynomial equations 
in two variables. Plane curves are not confined to the positive quadrant, so one might 
expect that their logic would have pushed them to allow their variables to take on 
both positive and negative values. But no! Their coordinates were only in a positive 
quadrant and the other parts of a curve were treated separately if at all. 

Below are two figures from Fermat's paper on the subject, Ad Locos Planos 
et Solidos Isagoge, (Introduction to Plane and Solid Loci). Incidentally, plane loci 
meant lines and circles, solid loci meant the other conic sections. terminology which 
dates from Greek times. 
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Figure 7. Two diagrams from Fermat's /sagoge 


In these figures N is the origin, NM is the x-axis (although Fermat used the letter 
A, not x because his variables were vowels), N D or N P is the y-axis (the letter E 
for Fermat), x = NZ, y = ZI sol is the point with coordinates (x, y). On the left, 
he is describing the locus of the equation: 


a’ +x-y=r-x+s-y (which he writes Dpl. + A in E aeq. R in A +S in E). 


Here s — NO,r — ND and d* is a constant area, so we have a rectangular 
hyperbola, centered at V , with asymptotes V O and V P. The curious point is that he 
draws only this small part of the hyperbola, cutting it off on the x-axis. He also cuts 
it off at the plotted point /. On the right, he is describing a parabola with equation: 


x6 Sho" (which he writes Aq .aequatur D in E) 


Again, he cuts the locus off at his axes (and at /). 

Descartes’ treatment is similar, except that he does say in the text that there are 
multiple orderings possible for the relevant points on the axes and that you must set 
up different equations depending on the directions and ordering of both the variable 
point and the constants in the construction. The goal is to make both sides of your 
equation sums of positive quantities, just as in Al-Khwarizmi and Cardano's work 
on quadratic and cubic equations. Note that this is how Fermat's version of the 
equation for the hyperbola reads. 


4. Clarifying the Muddle: Wallis and Newton 


So when did European mathematicians begin to make their peace with negative 
numbers? The first treatment which seems to me quite modern is that of John Wallis 
(1616-1703), Professor of Mathematics at Oxford. He published his Treatise on 
Algebra”, written in English, in 1685. This was just two years before Newton 
published by his earth-shaking Principia Mathematica and well after Newton had 
done his major work in mathematics. In his mathematical notes, where he used 


24 Available online at http://eebo.chadwyck.com through subscribing universities. 
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algebra and coordinates, Newton was equally modern in his treatment of negative 
numbers, putting them on equal footing with positive numbers. So we should 
attribute the first clear European view of negative numbers to Wallis and Newton 
equally. 

In Chapter 16, Addition, Subduction, Multiplication and Extraction of Roots in 
Specious Arithmetic, Wallis defines negative numbers as nicely, simply and clearly 
as you could wish (here “Specious’ is Viete's term for arithmetic with variables 
given by letters): 


To these Notes, Symbols or Species are prefixed (as occasion requires) not only numeral 
figures, but the signs + and — (or plus and minus), the former of which is a Note of Position, 
Affirmation or Addition; the other of Defect, Negation or Subduction: According as such 
Magnitude is supposed to be, or to be wanting. And where no such Sign is, it is presumed to 
be Affirmative and the sign + is understood. 


And accordingly these Signs are still to be interpreted as in a contrary signification, If 
+ signify Upward, Forward, Gain, Increase, Above, Before, Addition, etc. then — is to be 
interpreted of Downward, Backward, Loss, Decrease, Below, Behind, Subduction, etc. And if 
+ be understood of these, then — is to be interpreted of the contrary. 


In this quote, the capitalization is his. With this understanding of negatives, how 
does he justify the rule for multiplying negatives? Here is what he says: 


For the true notion of Multiplication is this, to put the Multiplicand, or thing Multiplied 
(whatever it be) so often as are the Units in the Multiplier. ... and this, whatever the 
thing Multiplied, Positive or Negative: for there may well be a Double Deficit as a Double 
Magnitude; and —2A is as much the Double of -A as +2A is the Double of A... . 


But in case the Multiplier be a Deficit or Negative quantity; suppose —1; then instead of 
Putting the Multiplicand so many times, it will signify so many times to Take away the 
Multiplicand. ... so that + by — makes —; But to Multiply -A by —2 is twice to take away a 
Defect or Negative. Now to take away a Defect is the same as to supply it; and twice to take 
away the Defect of A is the same as twice to add A or to put 2A .. . : So that — by — (as well as 
+ by +) makes +. 


As far as I know, this is the first place in Western literature in which the rule of signs 
is not merely stated but explained so clearly. After this, when he gets to writing 
out the formulae for roots of equations, he no longer has to separate all these cases 
which we saw in Al-Khwarizmi and Cardano. For the quadratic he writes: 


Given the equation, x? +2bx = +c? 


therootsarex +b = V+c?+b? 


(I have only changed his variable from a to x and noted squares by using e.g. c? for 
his cc.) Note that he follows Euclid is making all terms homogenous — so that, for 
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example, x, b, c can all be lengths and the equation relates an area to an area. For 
this reason, he needs the symbol + in front of the c?. 

Finally, Wallis gives what I believe is the first explicit use of the full number line, 
positives to the right, negatives to the left, in Western literature: 


Yet is it not that Supposition (of Negative Quantities) either Unuseful or Absurd when rightly 
understood. And though, as to the bare Algebraick Notation, it import a Quantity less than 
nothing: Yet, when it comes to a Physical Application, it denotes as Real a Quantity as if the 
Sign were +; but to be interpreted in a contrary sense. 


As for instance: Supposing a man to have advanced or moved forward (from A to B) 5 yards; 
and then to retreat (from B to C) 2 yards; If it be asked, how much had he Advanced (upon 
the whole march) when at C? I find... he has Advanced 3 Yards. But if, having Advanced 5 
Yards to B, he thence retreat 8 Yards to D; and it then be asked, How much is he Advanced 
when at D, or how much Forwarder than when he was at A: I say —3 Yards. ... That is to say, 
he is advanced 3 Yards less than nothing. . .. (Which) is but what we should say (in ordinary 


form of Speech), he is Retreated 3 Yards; or he wants 3 Yards of being so Forward as he was 
at A. 


[ A c E 
ERAH 


Figure 8. Wallis’s illustration of the “number line” 


Newton, as one would expect, had a full command of negative numbers and all 
their uses. He wrote lecture notes on arithmetic, algebra and geometry at some point, 
presumably early in his career. They were first published (without his approval) in 
1707 and later translated into English with the title Universal Arithmetick. Here he 
introduces negative numbers at the very beginning with the following sentences™: 


Quantities are either Affirmative, or greater than nothing; or Negative, or less than nothing. 
Thus in human affairs, possessions or stock may be called affirmative goods, and debts 
negative ones. And so in local motion, progression may be called affirmative motion, and 
regression negative motion; because the first augments, and the other diminishes the length 
of the way made. And after he same manner in geometry, if a line drawn in a certain way be 
reckoned for affirmative, then a line drawn the contrary way may be taken for negative. 


Later on, he discusses multiplication and is very clear that pure numbers arise as 
ratios of quantities with the same dimension and one can either multiply a quantity 
with a dimension by a pure number, getting another such quantity or multiply two 


25Page 3 of the second edition published in 1728. 
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pure numbers. He states the rule for the sign of the product simply as“... making the 
product Affirmative if both factors are Affirmative or both Negative; and Negative if 
otherwise.” Unfortunately, he says nothing about why one should believe in this rule. 

Whereas Fermat had given a systematic study of quadratic equations in two 
variables showing that they all defined conic sections and Descartes had introduced 
several cubic equations giving new curves (notably the “Cartesian parabola” and 
his "Folium"), Newton went on to look at all possible cubics, in an article entitled 
“Curves” in Lexicon Technicum by John Harris published in London in 1710. He 
classified them into 72 types and sketched them. Without hesitation, he used all four 
quadrants of the plane and plotted all roots (x, y), positive and negative. Here is an 
example: 


Figure 9. One of the 72 types of cubic curves plotted by Newton 


After Wallis and Newton’s work, a modern arithmetic with negative numbers 
was widely accepted in Continental Europe, where there was an explosion of math- 
ematical research during the Enlightenment. In England, curiously, the resistance 
to negative numbers continued for some 150 years, culminating in De Morgan. A 
long debate ensued between those who accepted them and those that didn’t, a story 
which is beautifully described in Pycior’s book that we have cited. In the end, De 
Morgan and Hamilton founded the general theory of fields and negative reals took - 
their place in the greater world of complex numbers and quaternions. 


5. Two Factors in the World View of 15*—17* Century Europe 
I hope I have proven my point that Europe in the 16" and 17" centuries resisted 


expanding their numbers to include negatives in a way which calls for some expla- 
nation. China and India both seem to have moved naturally to this bigger domain of 
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numbers when the occasion presented itself. I want to make the case that the Euro- 
pean reticence was due to two factors. The first was the overwhelming importance 
of Euclid in defining what is and what is not mathematics and the fact that negative 
numbers had no place in Euclid’s view of mathematics. The second is that, at the 
time negative numbers should have been accepted, imaginary numbers cropped up 
too and the idea arose that both negative and imaginary numbers had the same 
twilight existence. It was because of negatives that square roots had a problem, so 
maybe it was best to consider them both as second class citizens of the world of 
numbers. 

Euclid’s Elements were written in the newly founded school/library at Alexandria 
around 300 BCE and integrated the mathematical ideas of Theaetetus, Eudoxus and 
many others in a systematic treatise. It is written in a monolithic theorem/proof 
style not seen again in the History of Mathematics until the collective ‘Bourbaki’ 
composed their treatise in the 20" century. It was translated into Arabic in the 8" 
century CE and from Arabic into Latin in 12" century. As a result, it came to define 
what mathematics is for every generation of Arabs and Europeans, arguably until 
Newton and the Enlightenment when concepts with no roots in the Elements began 
to take center stage. 

But what is Euclidean mathematics? There are roughly three parts to the Ele- 
ments: Books I-VI on plane figures, Books VII-X on number theory and irrationals 
and Books XI-XIII on three dimensional geometry. What numbers occur in the 
Elements? Here's a list: 


1. "magnitudes": the length of a line, the area of a plane figure and the volume 
of a solid figure 


N 


. positive integers implicitly as in “The greater is a multiple of the less when 
it is measured by the less” (definition 2, Book V) and explicitly as in “A 
number is a multitude composed of units” (definition 2, Book VII). Note that 
the number is still a length but, because he always has a “unit” around when 
studying numbers, it becomes in effect dimensionless. 


3. ratios as in “A ratio is a sort of relation in respect of size between two 
magnitudes of the same kind" (definition 3, Book V). 


Note that none of these concepts give numbers which can be negative or even zero. 
What sort of arithmetic does Euclid have for these numbers? 


1. Magnitudes are clearly added and subtracted (so long as the result remains 
positive), lengths are multiplied to give areas and volumes, etc. But “units” 
are only introduced in Book VII and there are no actual calculations and 
certainly no approximations (e.g. for z ). 


2. Positive integers are also added and subtracted and multiplication is defined 
in *A number is said to multiply a number when that which is multiplied is 
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added to itself as many times as there are units in the other and thus some 
number is produced". 


3. Adding and multiplying ratios is the main goal in the extremely abstract Book 
V, which is said to be the work of Eudoxus. Book V begins with defining 
when two ratios are equal. For any ratio given by two lines A and B, he 
considers which multiples satisfy nA > mB and which satisfy nA < mB. 
Of course, this is the ‘cut’ Dedekind re-introduced in the 19 century to 
construct real numbers from rationals. Here Eudoxus doesn’t need to define 
real numbers — they are ratios given by geometry. What he needs to do is to 
define equality of ratios and he does this by requiring that their associated 
cuts are the same. Addition and multiplication of ratios are both implicit in 
that (a) if a line segment A is divided into two parts B and C then A : D 
is going to the sum of B : D and C : D and (b) A : C is to going to be 
the product of A : B and B : C. What is not at all clear is that addition 
and multiplication are well-defined operations on the equivalence classes 
called ratios. This is exactly what is asserted in Proposition 24, Book V (for 
addition) and in Proposition 22, Book V (for multiplication) after a long 
and subtle sequence of intermediate steps. One stands amazed at Eudoxus? 
mathematical skills. 


How about algebra, identities and formulas with the arithmetic operations? Euclid 
studies at length in Book II what people call ‘geometric algebra’, a series of propo- 
sitions which amount to algebraic identities such as 


(a 4- xY + (a — x = 2a? + 2x? 


which is essentially the content of Proposition 9, Book II. Now what about the 
solutions of quadratic equations? This seems to be essentially what the lengthy and 
confusing Book X is all about. As Heath points out in his introduction to Book X, 
Euclid's classification of binomials and apotomes can be read as a systematic study 
of all the positive roots of all possible quadratic equations. This sets the stage for 
the separation of cases in treating roots of polynomial equations in all the works we 
have reviewed. 

Allin all, if you are going to start with Euclid, you are not going to be predisposed 
to introduce negative numbers in to your calculations. He has gone to extraordinary 
lengths to reduce arithmetic and algebra to geometry and thoroughly inoculate it 
against negatives. It is worth looking briefly at what else was known at 300 BCE 
which Euclid did nor put in his book. There is apparently an unbroken tradition 
starting in Babylon in 1800 BCE and continuing through Ptolemy of calculating 
with the sexagesimal equivalent of decimals and approximating e.g. 2 and z to 
many sexagesimal places. Moreover, there was also a tradition also going this far 
back of solving quadratic equations by algorithms — described in words but exactly 
equivalent to the quadratic formula. Euclid, in other words, distanced himself from 
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a rich numerical tradition and consciously, it would seem, purified his version of 
mathematics. 

The Europeans, then, had the benefit of this shining example of pure math and 
of the wonderful deductive logic on which they built. But it was hard to go beyond 
it in any radical way, to model other phenomena in the real world which cried 
out for negatives. Euclid was both the strength and the weakness of the European 
mathematical world of the 16" and 17" centuries. 

But I think there is a second factor behind the slow acceptance of negatives which 
ought to be considered. As soon as one accepted —1, the algebra of the day thrust 
upon you formulae requiring its square root and this was truly inexplicable. The 
fate of —1 and i were inseparable. Cardano’s book makes this very clear. We have 
already quoted from Chapter 37, near the end of his book, entitled On the Rule for 
Postulating a Negative. The Chapter starts with the sentence: 


This rule is threefold, for one either assumes a negative, or seeks a negative square root, or 
seeks what is not. 


He is essentially equating three follies, all problematical. That he later entertained 
the idea that perhaps (—1)? ought to be equal to —1 shows how he viewed the 
problems as intertwined. Harriot (1560-1621) also played with both possibilities, 
as in the poem: 


Yet lesse of lesse makes lesse or more, 
Use which is best keep both in store 


(Here ‘lesse of lesse’ means multiplying —1 by —1 and he asks in line 1 whether 
this should equal —1 or +1). 

Even if you didn’t accept —1, the casus irreducibilis mentioned above, the case 
of cubic equations with three real roots, was a bone in the throat of algebraists. As 
long one of these roots was positive, you really ought to have a formula for this root. 
But the formula of del Ferro for solving cubics requires in this case that you take the 
square root of a negative number in an intermediate step. Of course, the imaginary 
parts of the resulting complex expressions will cancel at the end but not before. The 
full story of this problem is quite ironic. Viéte in 15937° discovered that trisecting 
an angle was equivalent to solving the special cubic equation which belongs to the 
casus irreducibilis: 


x? — 3x =b 


26In his Supplementum Geometriae. A full treatment is in Theoremata ad sectiones angulares [Opera 
pp 287-304] 
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and he showed how to reduce the general casus irreducibilis to this special case. 
Thus he reduced a famous unsolved algebraic problem to a famous geometric one, 
unsolved in the sense that no ruler and compass construction was known (nor exists). 
At the same time, Bombelli proposed that Cardano’s formula could make sense if 
you solved 


(x + JEDE =a+J/-b 


So trisecting an angle was related to taking complex cube roots — but no one put these 
together fora long time by finding the geometric meaning of complex numbers. Later 
we have Wallis, knowing the geometric meaning of negatives as the left half line, 
searching for a two dimensional geometric interpretation of imaginary numbers. 
There was a big clue on the table if anyone had linked Viete’s trisection with taking 
cube roots of complex numbers. I believe it was Euler who finally worked out 
complex exponentials and made the link between these two. Oddly enough, even 
then Euler did not make explicit the geometric interpretation of complex numbers, 
leaving this to Wessel, Gauss and Argand. 

Finally, there is also the issue of a psychological explanation for avoiding negative 
numbers. As Tversky and Kanneman have made popular, people are “loss averse’, 
a loss of $x causes more pain than a gain of $x and they do not act rationally using 
mathematically correct expectations. The fear of loss is one of themes in lonesco’s 
bizarre play The Lesson, where a young woman comes for a tutoring lesson: she 
can add with proficiency but cannot subtract. The mathematician doesn't come off 
very rational either: he winds up killing her. 

Mathematicians are attracted to Platonism, of believing that their discoveries 
are all insights into the eternal true world of mathematical facts. This example, 
the discovery of negative arithmetic and its incorporation into our numerical and 
algebraic toolkit, shows us that we must not be too literal. Yes, negative numbers 
were eventually accepted in the West as well as in China and India and all three 
cultures made the same math out of them. But there can be huge differences between 
cultures in the way mathematics unrolls. Euclid led the West down a certain path, 
dominated for many centuries by geometric figures and constructions. Other cultures 
were more practical and looked to solving concrete problems with approximate 
numbers. I think the discovery of calculus is another instance of this split: in India, 
studying the numerical table of sines led mathematicians to the idea of first and 
second differences and the fundamental theorem of calculus. But that is another 
story. 
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Kuttaka, Bhavana and Cakravala 


Amartya Kumar Dutta 


Ancient Indian mathematical treatises contain ingenious methods for finding integer solutions of 
indeterminate (or Diophantine) equations. The three greatest landmarks in this area are the kuttaka 
method of Aryabhata for solving the linear indeterminate equation ay — bx = c, the bhüvand 
law of Brahmagupta, and the cakravála algorithm described by Jayadeva and Bháskara II for 
solving the quadratic indeterminate equation Dx? + 1 = y?. We shall briefly recall the history 
of the above equations in ancient India and of their rediscovery in Europe, give an account of 
the ancient Indian texts dealing with algebra in general and the above indeterminate equations in 
particular. and mention a few works on history of mathematics which have highlighted ancient 
Indian algebra, especially indeterminate equations. We shall then discuss various mathematical 
aspects of the kuttaka, bhdvand and cakravála from the viewpoint of modern algebra and number 
theory and the general cultural atmosphere in which the leading Indian mathematicians undertook 
such explorations. We shall also examine the coverage of these results in texts involving history 
of mathematics. 


Introduction 


In a plenary talk at the International Congress of Mathematicians (1978), André 
Weil! asserted ({Wel, p 231-232]: 


An understanding in depth of the mathematics of any given period is hardly ever to 
be achieved without knowledge extending far beyond its ostensible subject-matter. 
More often than not, what makes it interesting is precisely the early occurrence 
of concepts and methods destined to emerge only later into the conscious mind of 
mathematicians; the historian's task is to disengage them and trace their influence 
or lack of influence on subsequent developments. 


Indeed, one of the tendencies in recent research on history of mathematics has 
been to discover in ancient works the equivalents of modern concepts and ideas 
lying concealed in archaic language and style. For instance, I.G. Bashmakova's 
innovative studies? on the work of Diophantus, with this approach, have contributed 


l André Weil (1906-98), one of the giants of 20th century mathematics, noted for his outstand- 
ing contributions in number theory and algebraic geometry, had a lifelong interest in the history of 
mathematics. 

2A retrospective of the life and work of Isabella G. Bashmakova was published in Historia Mathe- 
matica 8 (1981), p 389-392, on the occasion of her 60th birthday; a more detailed and updated account, 
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to the revival of interest in Diophantus and enhanced the understanding of his work. 
However there is a dearth of literature which makes an analogous sophisticated 
analysis of ancient Indian works on indeterminate equations? (and other topics) in 
the light of modern mathematics. 

Although the basic facts regarding ancient Indian works on the indeterminate 
equations ay — bx = c and Dx? + 1 = y? are well-known among historians of 
Indian mathematics, not many mathematicians, even in India, are aware of these 
works. Consequently, there is a general lack of awareness about the heights attained 
inancient Indian mathematics among scholars in other relevant disciplines (science, 
history, Indology, etc) as well as among general readers. Further, in several accounts 
on history of mathematics, one often finds misleading remarks about these works. 
Some of the accounts also reveal inaccurate notions on general Indian history. 

In view of the above observations, the present paper has been prepared with the 
following aims: 


(i) to present a mathematically non-technical history of the equations which 
will be accessible to readers who are not necessarily familiar with advanced 
mathematics; 


(ii) to initiate a study on the significance of the kuttaka, bhàvanà and the 
cakravala in the light of modern algebra and number theory and to explore 
their pedagogic potential; 


(iii) torecall relevant features of the cultural history of India during the “Classical 
Age"; 


(iv) to discuss some issues regarding the presentation of Indian works on inde- 
terminate equations in texts on history of mathematics. 


Roughly, sections 1—4 correspond to (i); sections 5—7 to (ii), and section 8 to (iii). 
Section 9 is specific to (iv); but the preceding sections, especially 3, 8 and parts of 
5—7 are all relevant to (iv). 

Section 1 gives a brief history of the methods for solving the equations 
ay — bx = c and Dx? + 1 = y? in ancient India and in Europe. Section 2 
gives an account of ancient Indian texts which deal with the equations while section 


with bibliography of her writings, was published in Historia Mathematica 29 (2002), p 370—382, on her 
80th birth anniversary. 

3An algebraic (i.e., polynomial) equation in more than one variable (or more generally, a system 
of m algebraic equations in more than m variables) is called indeterminate. The term is suggestive of 
the fact that such a system may have infinitely many solutions. Diophantus of Alexandria (c. 250 CE) 
investigated rational solutions (solutions in rational numbers) of indeterminate equations with integer 
coefficients. In his honour, the indeterminate equations with integer coefficients whose solutions are to 
be found in integers (or sometimes in rational numbers) are called Diophantine equations, the adjective 
“Diophantine” pertains not so much to the nature of the equation as to the nature of the admissible 
solutions of the equation. 
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3 gives a partial account of the representation of Indian works on indeterminate 
analysis in texts on history of mathematics. Section 4 presents the history of de- 
velopment of certain topics of basic mathematics relevant to the kuttaka. Sections 
5-7 make a mathematical discussion on the kuttaka, bhavaná and cakravala re- 
spectively. Section 8 highlights some features of the intellectual life of post- Vedic 
ancient India which are reflected in the legacy of Brahmagupta and other algebraists. 
Section 9 makes a critical discussion on certain views regarding the motivation of 
Brahmagupta and others in investigating the equation Dx? 4- 1 — y^. 

Readers interested in mathematical discussions may go directly to sections 5-7. 
Readers preferring non-technical discussions can first look at 1—3, 8, 9 and parts 
of 4. Parts of sections 5-7 (especially section 6) will also be accessible to general 
readers; they can go through these sections by skipping a few sentences involving 
mathematical technicalities. 


1. A Brief History of The Problems 
Indeterminate Analysis in Post- Vedic Ancient India 


The mathematical treatises of the Classical Age in ancient India exude the gen- 
eral intellectual robustness of the era. Two outstanding works of the period are: 
Aryabhatiya of Aryabhata (499 CE) and Brahma Sphuta Siddhanta of Brahmagupta 
(628 CE). These texts had a decisive influence on the course taken by Indian mathe- 
matics (and astronomy) for the next 1000 years. In particular, they were responsible 
for the emergence of a flourishing algebra culture in Indian mathematics. 

Apart from developing fundamental concepts and results of symbolic algebra", 
the algebraists of the Classical Age revelled in the challenging problems of finding 
integer solutions of indeterminate equations. The first explicit description of the 
general integral solution of the linear Diophantine equation ay — bx — c occurs 
in the Aryabhatiya (499 CE) of Aryabhata. The algorithm is described in just two 
cryptic stanzas (verses 32 and 33) at the end of the chapter Ganita (mathematics). The 
solution was subsequently discussed, with variations and refinements, by several 
Indian mathematicians including Bhaskara I (c. 600 CE)°, Brahmagupta (628 CE), 
Mahavira (850 CE), Govindasvamin (c. 860 CE), Prthüdakasvamin (c. 860 CE), 


^ Algebra, as commonly understood, begins with the use of symbols (letters of the alphabet to denote 
unknowns) and equations ([DS2, Preface]). This phase of the subject is sometimes called symbolic 
algebra — the adjective "symbolic" is used to distinguish it from ancient geometric and arithmetical 
works where algebraic knowledge and ideas are only implicit. 

5Different dates have been given for Bhaskara I like 522 CE (B. Datta), 574 CE (Sarasvati Amma) 
and 629 CE (K.S. Shukla). Brahmagupta's ninth-century commentator Prthidakasvamin places Brah- 
magupta (628 CE) to be later than Bhaskara I. We adopt 600 CE as a possible approximate date. See 
[Sar. p 9]. 
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Aryabhata II (950 CE), Sripati (1039 CE), Bhaskara II (1150 CE) and Narayana 
(c. 1350 CE). The algorithm was termed kuttaka (pulverizer), derived from the root 
kutt (to crush, to grind); the equation itself (orthe problem of finding integer solutions 
of the equation) was also referred to as kuttaka. The kuttaka principle was considered 
so important that the subject algebra itself was initially called kuttaka-ganita, or 
simply, kuttaka. Bhaskara I gives 30 examples in his commentary Aryabhatiya- 
bhasya to illustrate the method for solving linear indeterminate equations (see 
[Shu4, p 309-334] and [Ke, 130—166] for the examples); examples are also given 
in his astronomy treatises Maha-Bhaskariya (see [Shu2, p 29-46]) and Laghu- 
Bhaskariya (see [Shu3, p 99-114]). The Mahasiddhanta of Aryabhata II contains 
a separate chapter (18) called kuttaka exclusively on the solution to the linear 
Diophantine equation, Bhaskara II discusses the kuttaka in two of his treatises 
Lilàvati ([PNS, chapter 33, p 167—175]) and Bijaganita ({Ab, chapter 5, p 17-21]; 
[P. chapter 5, p 49-69]), Devaraja wrote a separate treatise Kurtakara-siromani 
exclusively on the topic ([DS2, p 88]). 

Having successfully dealt with the linear Diophantine equation, Indian alge- 
braists took up the harder problem of investigating the quadratic indeterminate 
equation Dx? + m = y?, called varga-prakrti (square-natured) in ancient India. 
They laid special emphasis on solving the important case Dx? + 1 = y?, where D is 
a positive integer which is not a perfect square.° The first major breakthrough in this 
problem was achieved by Brahmagupta in 628 CE through a brilliant innovation — 
a law of composition called bhavana. This composition principle of Brahmagupta 
is of paramount significance in modern algebra and number theory. 

Brahmagupta applied his composition rule to generate an infinite number of 
integer solutions from a given integer solution of Dx? + 1 = y? and also showed 
how to arrive at a given integer solution in a wide variety of cases (that is, for various 
values of D like D = 83 or D = 92). While he had thus given a partial solution to the 
problem, another Indian algebraist subsequently discovered the complete integer 
solution by a cyclic method called cakravala (cakra: wheel or disc) in ancient India. 
The discovery took place sometime during the 7th-1 1th century; for the cakravala 
algorithm has been described by Jayadeva (who lived prior to 1073 CE)’ and 
the famous astronomer-mathematician Bhaskara II (1150 CE). However, the true 
originator is not known (see next section). Bhaskara II illustrated the cakravala with 


Sif Dis negative, or if D is a square of a positive integer, then Dx? + 1 = y? has only finitely many 
integer solutions; in fact, it is easy to see that (0, +1) are the only solutions in all these cases except 
D # —1, and (+1, 0); (0, +1) are the only solutions when D = —1. It is when VD is an irrational real 
number that the problem becomes mathematically interesting, and it turns out that the equation will have 
infinitely many solutions. 

7Yayadeva's verses on the solution of the indeterminate equation Dx?--1 = y? have been quoted in the 
text Sundari of Udaydivakara composed in 1073 CE. Nothing is known so far about this mathematician 
Jayadeva. He is not to be confused with the Vaisnava poet Jayadeva of the 12th century who composed 
Gita-Govinda. 
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difficult numerical cases like D = 61 and D = 67. For the equation 61x +1 = y?, 
the smallest solution in positive integers is x = 226153980. y = 1766319049. 
indicating the unexpected intricacy of the problem.* Narayana (c. 1350 CE) too 
discussed solutions of the equation Dx? + 1 = y?, illustrated the method with 
the cases D = 97 and D = 103, and showed how to use the solutions to give 
rational approximations to VD (see section 7). Besides the equations ay — bx — c 
and Dx? + m = y?, several other indeterminate and simultaneous indeterminate 
equations also occur in the treatises of some of the above mathematicians, especially 
Bhaskara II. 

The equations ay — bx = c and Dx? + 1 = y? are important equations in 
modern mathematics. But the Indian works on such indeterminate equations during 
the 5th-12th centuries were too advanced to be appreciated or noticed by Arab and 
Persian scholars and did not get transmitted to Europe during the medieval period. 
For instance, when Fyzi translated Bhaskara II’s treatise Lilàvati? into Persian, he 
omitted the portion on indeterminate equations. 

We now narrate some of the landmarks in the history of the two indeterminate 
equations ay — bx = c and Dx? + 1 = y? in post-Renaissance Europe. 


Indeterminate Analysis in Modern Europe 


Problems in indeterminate equations were taken up in Europe during the 17th 
century. These problems fascinated some of the best European mathematicians of 
the 17th-18th century who not only rediscovered solutions to the problems but also 
developed general theories and techniques in this connection. 

The integer solution of the linear Diophantine equation was described in Europe 
for the first time by C.G. Bachet de Méziriac (1581—1638) more than eleven centuries 
after Aryabhata. In the first edition (1612 CE) of his book Problémes plaisants et 
délectables qui se font par les nombres'®, Bachet stated that if a and b are relatively 
prime integers, then one can find a least integral multiple of a which exceeds an 
integer multiple of b by a given integer c. The proof was given in the second edition 
of the book (1624). Dickson mentions that Bachet “employed notations for 18 
quantities, making it difficult to hold in mind the relations between them and so 
obtain a true insight into his correct process" ([Di, p 44]). Bachet also mentioned 


SContrast it with the minimum solution of 60x? +l=y*:itisx 24, y 231. 

9A. popular text on arithmetic and mensuration with some algebra. It was translated by Fyzi from 
Sanskrit into Persian, at the behest of Emperor Akbar, in 1587 CE. 

V picasant and delectable problems to be solved by numbers. Bachet had been interested in. math- 
ematical recreations and puzzles. This book is a collection of such puzzles. In 1621, Bachet published 
a bilingual edition (Greek original with Latin translation) of Diophantus' Arithmetica with extensive 
commentary. This work was largely responsible for the resurgence of number theory in Europe. Weil 
remarks ([We2, p 33]): “... his is the merit of having provided his successors and notably Fermat with 
a reliable text of Diophantus along with a mathematically sound translation and commentary.” 
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his solution in his edition (1621) of Diophantus’ Arithmetica emphasising that the 
method is his own ([We2, p 7]). After Bachet’s work, several mathematicians in 
Europe continued to discuss various approaches to the equation ay — bx — c. Some 
of the early names include J. Kersey (1673), M. Rolle (1690), T.F. de Lagny (1697), 
L. Euler (1735), N. Saunderson (1740), W. Emerson (1764), J.L. Lagrange (1767), 
J. Bernoulli (1772) among many others; see [Di, Ch II]. As late as in 1798 (nearly 
two centuries after Bachet), the great Legendre wrote in the preface of his Theory 
of Numbers (quoted in translation in [G, p 3]): 


Bachet,..., solved the indeterminate equation of the first degree by a general and very 
ingenious method. 


Pierre de Fermat (1601-65), who is regarded as the father of Modern Number 
Theory, tried to create enthusiasm for the subject among his contemporary math- 
ematicians like John Wallis (1616-1703) and Lord William Brouncker (1620-84) 
of England, B. Frenicle de Bessy (c. 1612-75) of France and F. van Schooten!! 
(1615-60) of Netherlands. Fermat, who had carefully read Bachet's version of 
Diophantus, laid special emphasis on methods for finding integer solutions of equa- 
tions as distinct from the easier, though important, question of finding rational 
solutions that occurs in Diophantus. One of the problems through which Fermat 
chose to highlight the beauty and intricacy of number theory was the problem of 
finding integer solutions to Dx? + 1 = y?, a problem explored by Brahmagupta 
in 628 CE and solved by an Indian algebraist prior to the 11th century. Among 
the two specific examples that Fermat proposed to Frenicle in 1657 CE was the 
case D = 61 which occurs in Bhaskara II's Bijaganita (1150 CE).!* To get some 
feel for the intensity of Fermat's involvement with number-theoretic methods, es- 
pecially the equation Dx? + 1 = y?, we may cite excerpts from his letter (1657) 
presenting his challenge to the English mathematicans (translation by Heath; quoted 
in [G, p 17]): 


Thereis hardly anyone who proposes purely arithmetical questions, hardly anyone who under- 
stands them. Is this due to the fact that up to now arithmetic 13 has been treated geometrically 
rather than arithmetically? This has indeed been the case both in ancient and modern works; 
even Diophantus is an instance. For, although he has freed himself from geometry a little more 
than others have in that he confines his analysis to the consideration of rational numbers, . . . 


Now arithmetic has, so to speak, a special domain of its own, the theory of integral numbers. 
This was only lightly touched upon by Euclid in his Elements, and was not sufficiently 


1! 4 Dutch mathematician who is most famous for popularising the cartesian geometry of Descartes. 

12The other case proposed by Fermat was D = 109. The smallest solution of 109x? + 1 = y? is 
given by x — 15140424455100, y — 158070671986249. 

I3Here “arithmetic” refers to “higher arithmetic”, i.e., number theory, the study of abstract properties 
of numbers. In Greece, usual computational arithmetic was called "logistic" while the term "arithmetic" 
meant number theory. 
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studied by those who followed him ... ; arithmeticians have therefore now to develop it or 
restore it. 


To arithmeticians therefore, by way of lighting up the road to be followed, I propose the 
following theorem to be proved or problem to be solved. If they succeed in discovering the 
proof or solution, they will admit that questions of this kind are not inferior to the more 
celebrated questions in geometry in respect of beauty, difficulty or method of proof. 


Given any number whatever which is not a square, there are also given an infinite number of 
squares such that, if the square is multiplied into the given number and unity is added to the 
product, the result is a square. 


Regarding Fermat's challenge, André Weil remarked ([We2. p 81—82]): 


What would have been Fermat's astonishment if some missionary, just back from India, had 
told him that his problem had been successfully tackled there by native mathematicians almost 
six centuries earlier! 


A general method for solving Fermat's problem was discovered by Brouncker in 
1657.'* During 1657-58, there had been an exchange of letters among mathemati- 
cians like Frenicle, Brouncker and Wallis who had taken interest in Fermat's prob- 
lem. These letters were published by Wallis in Commercium Epistolicum (1658). 
In 1685, Wallis published his monumental work A treatise of algebra both histor- 
ical and practical (in short, Algebra) comprising 100 chapters. Chapter 98 of this 
treatise was devoted to Fermat's problem and was based on the correspondence pub- 
lished in Commercium Epistolicum. Brouncker's method for solving the equation 
Dx? + 1 = y? was described in this chapter in a standard form. An enlarged second 
edition of the treatise A/gebra was published as the second volume of Wallis' Opera 
Mathematica (1693). 

Wallis had attempted to prove that Dx? + 1 = y? always has a positive integral 
solution, but used an incorrect result ((Di, p 354]). Fermat had asserted in his 
correspondences of 1659 that he had proved by his method of "descent" that the 
equation Dx? + 1 = y? has infinitely many integer solutions (when D is a positive 
integer which is not a perfect square). The proof has not been found in any of his 
writings. 

Fermat's problem was again taken up in the next century by the two greatest 
figures of 18th century mathematics: L. Euler (1707-83) and J.L. Lagrange (1736—- 
1813). Euler became interested in the problem from around 1730 and Lagrange took 
it up around 1768. They developed the theory of continued fractions, investigated 
the problem in the framework of this theory and established that if D is not a perfect 
square, then A/D has an infinite but periodic continued fraction expansion, and all 


'4Tn response to a challenge posed by Frenicle, Brouncker was able to use his method to find the 
(smallest) solution x=1819380158564160, y=32188120829134849 to the equation 313x? + l = y 
which he claimed had taken him only “an hour or two" ({OR]). 


CC-0. Gurukul Kangri Collection, Haridwar 


pcm tn,» tA 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


152 Amartya Kumar Dutta 


solutions (p, q) to the equation Dx? + 1 = y? are given by certain convergents 
5s of the expansion.'? Moreover, there exists a positive integral solution (xi, yi) 
of Dx? + 1 = y? (called the fundamental solution) such that all positive integral 
solutions are given by (Xn, Yn) where xn», y, are defined by the relation y, + J/Dx, = 
(vi J/Dx;)". While the initial discoveries were made by Euler, it was Lagrange 
who first published formal proofs of all these results during 1768-69. He included 
them in his Additions to Euler's Elements of algebra (composed in 1771, published 
in 1773). Brouncker's method could be re-interpreted in the language of continued 
fractions; in that presentation, it becomes equivalent to Euler's method. The precise 
historical details are given in [We2, p 183; 229—233]. 


The Label ‘‘Pell’s Equation” 


Selberg mentioned in an interview'® that André Weil once said that if something 
in mathematics gets attached to the name of a person, then the person in question 
usually has very little to do with it. The equation Dx? + 1 = y? was attributed to 
the English mathematician John Pell (c. 1611-85) by Euler although there is no 
evidence that John Pell had seriously investigated the equation. Euler’s confusion 
could have been created through a cursory reading of Wallis’ Algebra, a large part of 
which was devoted to the work of five English mathematicians: Oughtred, Harriot, 
Pell, Newton and Wallis himself. In any case, the name “Pell’s equation" stuck. As 
Weil put it ([We2, p 174]): 


Pell's name occurs frequently in Wallis's Algebra, but never in connection with the equation 
x? — Ny? = 1 to which his name, because of Euler's mistaken attribution, has remained 
attached; since its traditional designation as "Pell's equation” is unambiguous and convenient, 
we will go on using it, even though it is historically wrong. 


As suggested by R. Sridharan ([Sr2, p 17]), the equation should be called “Brah- 
magupta equation” as a tribute to the genius who first made a contribution to the 
problem, a thousand years before the time of Fermat and Pell. 


Some Ancient Problems Involving Dx? + 1 — y? 


Some of the rational approximations to V D, which appear in (late Vedic) Indian 
and Greek texts, implicitly involve equations of the type Dx? + 1 = y?. In the 


15A well-motivated student-friendly account of the theory is presented in chapter 4 of ({G]). Students 
may also look at ([BrC, Chapters XXIV and XXXIIIJ) for an organised presentation of the theory of 
continued fractions with application to the equation Dx? + 1 = y?. 

lóBulletin AMS 45(4), October 2008, p 621. 
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Sulba-sütras (c. 800 BCE), the most ancient extant mathematics treatises, z, B 
i- have been used as approximations for 4/2 ([D2, p 202]: [Di. p. 341]: (Sar, 
p 18]. These three fractions may be interpreted as arising out of solutions of 
2x! +1 = y*; for2 x 5—1 = 7,2x 12741 = 17 and 2 x 4082 pd = 
577°. In fact, they are respectively the third, fourth and eighth convergents of 
the simple continued fraction expansion of 4/2. We shall revisit these fractions in 
section 7. 

Archimedes (287-212 BCE) gave the approximations 72 and L5! for /3. In 


153 780 
his commentary on the work of Archimedes, Eutocius (c. 480-540 CE) mentions 
the relations 265? — 3 x 153? — —2 and 1351? — 3 x 780? — 1as a verification of 


the validity of the approximations ({We2, p 16]). 

In 1773, G.E. Lessing discovered an ancient Greek epigram which states a 
problem (known as the “Cattle Problem") which is believed to have been com- 
municated by Archimedes to Eratosthenes for the mathematicians of Alexandria. 
In this problem one is required to find eight integers (number of bulls and cows 
each of four colours) satisfying nine conditions linear and quadratic. After some 
algebraic manipulations, the cattle problem reduces to the problem of finding a 
positive integral solution to Dx? + 1 = y? where D = 4729494. There is no 
evidence that Archimedes had made this connection ([OR]) and it is thought ex- 
tremely unlikely that either Archimedes or any of his contemporaries had actu- 
ally solved the cattle problem. The problem was first solved by A. Amthor (1880 
CE); the smallest value of one of the variables in the cattle problem is a number 
having 206545 digits (see [Di, p 342-345], [L] and [Wi, p 400—402] for further 
details). 


Cakravala Revisted 


The Indian cakravála needs fewer steps than the later methods of Brouncker and 
Euler; however a formal justification for the cakravala method is less elegant. Some 
mathematicians therefore prefer to describe the Brouncker-Euler algorithm when 
writing on the cakravdla for students (see, for instance, [Vr, p 28—35]). 

A formal proof that cakravala method will always work (that is, terminate after 
a finite number of steps) was given by Krishnaswamy Ayyangar in 1929 ([Kr1 ]). 
Ayyangar related the method with the theory of a half-regular continued fraction!" 
which he developed using only the kind of elementary mathematics which would 
have been known in India by the 5th century CE. 

Another approach was given by C.O. Selenius (1960) who developed a gener- 
alised continued fraction expansion, which he called “ideal expansion", used it to 


See [Kr1], [Kr2], [Kr3] for Ayyangar's work and [MRW] for recent applications of Ayyangar's 
theory of "nearest square continued fraction". 
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give an algorithm for solving the equation Dx? + 1 = y? in which the number of 
steps is minimised, and showed that his “ideal expansion” method is equivalent to 
the Indian cakravala ([Sel]).'* 


Legacy of *‘Pell’s Equation" 


The study of indeterminate equations, especially the study of the so-called “Pell’s 
equation”, played an important role in the evolution of classical algebra in ancient 
India and later in modern Europe.!” 

Pell’s equation has had applications throughout history. Large integer solutions of 
Dx? +1 = y? have been used in ancient times to yield good rational approximations 
to /D (see section 7). The solutions of the equation yield units in the domain of 
integers of the quadratic field Q(/D). The equation is also closely related to the 
study of binary quadratic forms. 

The solution of Pell’s equation is the main step in the solution of the general 
quadratic Diophantine equation in two variables. It also played a role in the solution 
(1970) of Hilbert’s 10th problem on the non-existence of an algorithm for solving 
arbitrary Diophantine equations. 

Pell’s equation continues to fascinate mathematicians even today. Research pa- 
pers and articles are being published on it in various contexts; two of the recent 
papers ([Ha], [MRW ]) are listed in the bibliography. An extensive bibliography on 
the equation and on methods for solving it can be found in the 2002 paper ({Wi]) of 
Hugh Williams. The algorithmic efficiency of various methods are discussed in the 
article of H.W. Lenstra ([L]). An expository book ([Br]), exclusively on the topic, 
has been published recently (2003). The author E.J. Barbeau remarks in the Preface: 


Pell’s equation seems to be an ideal topic to lead college students, as well as some talented 

and motivated high school students, to a better appreciation of the power of mathematical 
. 2 

technique.7” 


Efficient generation of solutions of the equation is a very active area of research in 
algorithmic number theory and computer science. While there does not exist any 
polynomial-time algorithm for solving Pell’s equation, S. Hallgren has exhibited a 


'8selenius presented this work in a session on "Algebra and Theory of Numbers" at the International 
Congress of Mathematicians (1962). His abstract begins with a reference to Brahmagupta and Bhaskara 
("Abstracts of Short Communications" of the ICM, Stockholm, p 51). 

191 G. Bashmakova has emphasised that solving indeterminate equations was a significant part in the 
development of algebra in Europe. See her article Diophantine equations and the evolution of algebra 
in the Proceedings of the International Congress of Mathematicians (ICM), Berkeley, California (1986). 

20 This is also the view of the present author who feels that the Indian works are particularly suited 
for this purpose (see [Du1, p 86—89]). 
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polynomial-time quantum algorithm for the equation ({Ha]). As Lenstra remarks 
([L. p 192]): 


The last word on algorithms for solving Pell's equation has not been spoken yet. 


2. Texts on Algebra in the Classical Age 


As mentioned earlier, the subject of algebra was developed during the Classical Age, 
consciously, as a distinctand important branch of mathematics. The text Aryabhatiya 
(499 CE) already contains algebraic ideas, like a term gulika (^shot^; possibly 
"coloured shot") for an unknown quantity and algebraic results like solutions of 
linear and quadratic equations, formulae for arithmetic progression and other sums 
of finite series and, of course, the integer solutions of linear Diophantine equations. 
Bhaskara I (c. 600 CE) discusses the kuttaka (the linear Diophantine equation) in 
detail in the first chapter (verses 41—52) of his astronomy treatise Maha-Bhdaskartya 
([Shu2, p 29—46]), apart from the discussions, with examples, in his commentary 
Aryabhatiya-bhasya ([Ke, 128—166]). 

Itis however the Brahma Sphuta Siddhanta (628 CE) of Brahmagupta which laid 
a firm foundation for the science of algebra in India. This is the first known ancient 
Indian text containing a separate chapter on algebra. Chapter 18 of this volumi- 
nous treatise (comprising over 1000 verses in 24 chapters) is titled Kuttakadhyayah 
(= Kuttaka (algebra) + adhyáyah (chapter)). Towards the beginning and the end 
of this chapter, Brahmagupta emphasises the importance (rather indispensability) 
of the subject and its charm. He systematically expounds the basics: describes 
the use of symbols for unknowns, defines zero as an integer in algebra, intro- 
duces negative numbers and prescribes the rules of arithmetic operations in the 
enlarged number system (including negative numbers and zero). discusses the 
arithmetic of surds, explains the principles of formation and manoeuvring of 
equations (plan of writing. arrangement, clearances, etc), gives methods for solv- 
ing quadratic equations and simultaneous linear equations in several unknowns 
(anticipating the Gaussian method of elimination), etc. A substantial portion of 
the chapter on algebra discusses indeterminate equations of the first and second 
degree. 

The impact of Brahmagupta on algebra can be seen from the fact that sep- 
arate monographs on the subject (usually titled Bijaganita^! ) were brought out 
subsequently by prominent mathematicians like Sridhara (750 CE), Padmanabha 
(date unknown but before Bhaskara II), Sripati (1039 CE), Bhaskara II (1150 CE), 


21 The subject of algebra gradually acquired the current Sanskrit term bijaganita. Bija means "seed", 
“root”, “element”, as well as “analysis”; thus bijaganita literally means the “mathematics with elements 
(letters)" as well as “analytical mathematics". 
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Narayana (1350 CE) and later mathematicians. Unfortunately, the specialised al- 
gebra texts of Sridhara, Padmanabha and Sripati — the predecessors of the great 
Bhaskara II — have not been found.” 

Apart from algebra texts, the general texts on mathematics — like the Ganita- 
sára-sangraha (750 CE) of Mahavira — also contain several results on topics 
in algebra like quadratic equations, summations of finite series (like arithmetic 
progression (AP) and geometric progression (GP)), permutations and combinations, 
and sometimes even the kuttaka. Some of the astronomer-mathematicians wrote 
separate chapters on algebra in their general texts. Chapter 14 of the astronomy 
treatise Siddhantasekhara of Sripati, entitled avyakta-ganita, is on algebra. 

Discussions on the relatively advanced or specialised topics in algebra, like solu- 
tions of quadratic indeterminate equations, were usually confined to the Brjaganita 
texts. The Bijaganita of Bhaskara II contains — apart from basic results of algebra 
— a lucid exposition of the kuttaka, bhavand and the cakravala, and mentions sev- 
eral other indeterminate equations. Bhaskara II attributes the cakravala to ancient 
authors without specifying any name. At the end of his treatise, Bhaskara II makes a 
general acknowledgement of the prolific algebra works of Brahmagupta”, Sridhara 
and Padmanabha as his chief sources. There is therefore a strong possibility that the 
lost treatises of Sridhara and/or Padmanabha too contain an account of the cakravála 
algorithm. The history of cakravala became more mysterious with the discovery 
(in 1954) of a work dated 1073 CE which quotes verses from a preceding algebraist 
Jayadeva describing the cakravála algorithm (along with the bhāvanā). Curiously, 
Jayadeva is not mentioned by Bhaskara II. 

It is remarkable how early the leading Indian mathematicians had realised the 
significance of algebra and how strongly they asserted and established the impor- 
tance of their newly-founded discipline. Brahmagupta declares at the outset of his 
Kuttakadhyayah (chapter on algebra) that algebra is indispensable for solving most 
problems, gives a list of topics in algebra, and declares that one who has mastered 
those topics will be venerated as an ácárya among the learned. And, towards the 
end of the chapter, he celebrates the delight and glory of the subject through the 
words ([Sha, Ch 18; verses 99-100]; [Du1, p 104]): 


These questions are stated simply for delight. One may devise a thousand others, or may 
resolve the problems proposed by others, by the rules given here. 


As the sun eclipses the stars by his brilliance, so will an expert eclipse the glory of other 
astronomers in assemblies of people by the recital of algebraic problems, and still more by 
their solution. 


22Bhaskara II refers to the extensive algebra works of Sridhara and Padmanabha ([Ab, p 52]; [P. p 
196]); Munisvara (c. 1608 CE) mentions the Bijaganita of Sripati ([Sih, p 26]). 

23The verse of Bháskara II ({Ab, p 52]; [P, p 196]) actually mentions Brahmi. Itis generally presumed 
by historians that Bhaskara II was referring to Brahmagupta. 
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In order to emphasise the importance, power, and profundity of algebra, the 
poet-mathematician Bhaskaracarya (Bhaskara II) begins his treatise Bijaganita 
with an Invocation involving an interesting “pun” on the words Samkhyáh (the 
Samkhya philosophers as well as the experts in samkhya, the science of num- 
bers), Satpurusa (the Self-Existent Being as well as the wise mathematician), bija 
(root/cause as well as algebra) and vyakta (the manifested universe as well as the rev- 
elation of an unknown quantity). Thus, through the opening verse, Bhaskaracarya 
venerates the Unmanifested — the Self-Existent Being of the Samkhya philoso- 
phy — who is the originator of intelligence and the primal Cause of the known 
or manifested universe; and, through the very same words, Bhaskaracarya pays 
tribute to the wise mathematician who, using algebra, solves a problem (i.e., re- 
veals or manifests an unknown quantity). The importance of algebra is reiterated 
at the end of Bijaganita. Bhaskaracarya remarks that algebra is the essence of all 
mathematics, is full of virtues and free from defects; and that the cultivation of 
algebra will sharpen the intellect of children. He concludes with the exhortation 
"patha patha" (Learn it, learn it) for the development of intelligence ([Ab, p 7; 53]; 
[P, p 4; 198]). 

Another algebraist Narayana (1350) exalts the subject with the following tribute 
([DS2, p 5]): 


As out of Him is derived the entire universe, visible and endless, so out of algebra follows the 
whole of arithmetic with its endless varieties (of rules). Therefore, I always make obeisance 
to Siva and also to (avyakta-) ganita (algebra). 


3. Ancient Indian Algebra in History of Mathematics 


In this section, we give a brief account of the discovery of some of the above works 
by modern historians and of expositions on them in some of the important early 
texts on history of mathematics. 

English translations of ancient Indian mathematics texts, especially of 
Bhaskara II. began appearing from early 19th century. Bhaskara’s Bijaganita (alge- 
bra) was translated from Persian” into English by E. Strachey under the title Bija 
Ganita or the Algebra of the Hindus (London, 1813). The Lilavati of Bhaskara was 
translated from the original Sanskrit by John Taylor (Bombay, 1816). 

The work during this period which, perhaps, had the greatest impact in generating 
an awareness about ancient Indian mathematics was the translation by H.T. Cole- 
brooke titled Algebra, with Arithmetic and Mensuration, from the Sanscrit of Brah- 
megupta and Bhascara (London; 1817). Colebrooke translated the texts Lilavati and 
Bijaganita of Bhaskara II and the chapters Ganitadhyayah (Ch 12; arithmetic and 
mensuration) and Kuttakadhyayah (Ch 18; algebra) of Brahma Sphuta Siddhanta 


24Based on the Persian translation of 1634 CE by Ata Alla Rushdi. 
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of Brahmagupta, along with selections from the reputed commentary on Brahma 
Sphuta Siddhanta titled Vasanabhasya (c. 860 CE) by Caturveda Prthüdakasvamin. 

Colebrooke also wrote a long preface where he discussed some of the distinguish- 
ing features of the texts. This preface was reproduced in Colebrooke’s Miscellaneous 
Essays (1837) and its contents thereby got the attention of a wide community of 
serious scholars on India’s cultural history. One of the features of Indian algebra 
that Colebrooke highlighted was ([Co2, p 438 ]): 


General methods for the solutions of indeterminate problems of first and second degrees, 
in which they went far, indeed, beyond Diophantus, and anticipated discoveries of modern 
algebraists. 


For some time during the 17th—19th centuries, Aryabhatiya had been unavailable 
to scholars of Indology, though its existence and contents had been known through 
other available texts. In 1817, Colebrooke laments ([Co2, p. 422]): 


A long and diligent research in various parts of India, has, however, failed of recovering any 
part of the Padmanabha vija (or Algebra of Padmanabha-??), and of the algebraic and other 
works of Aryabhatta. 


Although a few manuscripts of Aryabhatiya were available in parts of Kerala, In- 
dologists did not come across them for several decades till Bhau Daji? discovered 
a manuscript in 1864. The text, with the commentary by Parame$vara, was then 
published by H. Kern in 1874 at Leiden (Netherlands). Subsequently, there have 
been several English translations of Aryabhatiya with commentaries by historians 
of science like P.C. Sengupta from Calcutta (1927) and W.E. Clark from Chicago 
(1930). In the 1500th birth anniversary year of Aryabhata (b. 476 CE), the In- 
dian National Science Academy (INSA) published the Aryabhatiya with English 
translation and notes by K.S. Shukla and K.V. Sarma ([ShuS ]), along with the com- 
mentaries of Bhaskara I and Somesvara (edited by K.S. Shukla) and the commentary 
of Süryadeva (edited by K.V. Sarma). Recently, A. Keller has prepared an English 
translation of the portion of Bhaskara I's Aryabhatiya-bhasya which discusses the 
mathematics chapter Ganita of Aryabhatiya ([Ke]). 

The text Ganita-sara-sangraha (750 CE) of Mahavira was discovered in early 
20th century and published in 1912 with an English translation by M. Rangacharya. 


25The works of Padmanábha remain elusive. 

264 very successful and popular medical practitioner, Dr. Bhau Daji (1821-1874) made immense 
contributions to Indological research: travelling throughout the country (visiting forts, temples and 
caves) in search of ancient inscriptions; collecting a large number of manuscripts and a good number of 
copper-plates and movable epigraphs; and publishing several papers on the history of Indian numerals, 
chronology, epigraphy and literature. 
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The works of Bhaskara I were discovered by B. Datta? in 1930 and published by 
K.S. Shukla during 1960-76 ([Shu2], (Shu3], [Shu4]). 

In 1954, K.S. Shukla discovered the text Sundari (1073 CE) of Udaydivakara. 
a commentary on the Laghu-Bhaskartya of Bhaskara I, and found that it contains 
quotations from a hitherto unknown mathematician called Jayadeva on the bhavand 
and the cakravála. He published the verses in (Shul ]. 

The book by Datta-Singh ([DS2]), published in 1938 and reprinted in 1962, 
remains the richest reference book on ancient Indian algebra. Much information 
on indeterminate analysis is recorded in the book. However there is not much 
discussion on the significance of the works from the viewpoint of modern algebra. 
The ideas of abstract modern algebra had not become commonplace among the 
general mathematical community in India in the 1920s and 30s when Datta did his 
pioneering work on history of mathematics. 

A few books published from the West in early 20th century had incorporated facts 
on ancient Indian mathematics with reasonable accuracy. L.E. Dickson's History 
of the Theory of Numbers Vol II (1920) contains considerable mention of Indian 
contributions to the study of indeterminate equations. In fact, in the Preface, he 
makes a special mention of Brahmagupta's work ([Di, p. xi]): 


. . 28 . . 
It is a remarkable fact that the Hindu Brahmegupta~® in the seventh century gave a tentative 
method of solving ax? + c = y? in integers, which is a far more difficult problem than its 
solution in rational numbers. 


However, Dickson was not aware of the discovery of Aryabhatiya in 1864 and, like 
Colebrooke, refers to it as a lost treatise ([Di, p 41]). 

The book History of Mathematics by F. Cajori (1919) contains a short chapter 
"The Hindus" where there is a brief but admiring reference to Indian works on 
indeterminate equations ([Ca, p 94—98]): 


... Indeterminate analysis was a subject to which the Hindu mind showed a happy adap- 
tation. We have seen that this very subject was a favorite with Diophantus, and that his 
ingenuity was almost inexhaustible in devising solutions for particular cases. But the glory 
of having invented general methods in this most subtle branch of mathematics belongs to 
the Indians. The Hindu indeterminate analysis differs from the Greek not only in method, 
but also in aim. The object of the former was to find all possible integral solutions. Greek 
analysis, on the other hand, demanded not necessarily integral, but simply rational answers. 


?7Bibhutibhusan Datta (1888-1958), later Swami Vidyaranya, is a pioneer among historians of 
Indian mathematics. His numerous papers contain a wealth of information and insights for anyone 
seriously interested in the history of ancient Indian mathematics; unfortunately most of them are not 
easily accessible. A D.Sc. in Applied Mathematics, Datta possessed the rare combination of professional 
mathematical experience with deep knowledge of Sanskrit literature. Apart from his books on history of 
Indian mathematics ([D2]. [DS1], [DS2]), he has also written a few books on Indian philosophy. 

2&The effect of Colebrooke on Dickson can be seen in the spelling of Brahmagupta. 
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Diophantus was content with a single solution; the Hindus endeavored to find all solutions 
possible. ... 

Remarkable is the Hindu solution of the quadratic equation cy? = ax? + b. With great 
keenness of intellect they recognized in the special case y? = ax? + 1 a fundamental problem 
in indeterminate quadratics. They solved it by the cyclic method... . 


Doubtless this "cyclic method" constitutes the greatest invention in the theory of numbers 
before the time of Lagrange.” . . . 


.. . Unfortunately, some of the most brilliant results in indeterminate analysis, found in Hindu 
works, reached Europe too late to exert the influence they would have exerted, had they come 
two or three centuries earlier. 


While several pioneering scholars like Colebrooke, Hankel and Thibaut at- 
tempted objective presentations of different portions of ancient Indian mathemat- 
ics, there were other influential writers, steeped in Eurocentric views, who sought 
to undermine it. The authors who were prejudiced in their portrayal of ancient 
Indian mathematics did not even bother to be accurate or careful in conforming 
to known facts. For instance, any work on the Pellian equation having some so- 
phistication was believed to be of Greek origin; and many authors who favoured 
the hypothesis of Greek origin did not have clear ideas regarding the distinction 
between Brahmagupta's brilliant but partial work and the subsequent complete 
cakravala algorithm (which is a strong indicator that the research was indigenous). 
Two errors commonly occurred: there were some who had the impression that the 
final cakravála method also occurs in Brahmagupta's work; and there have been 
writers who thought that Bhaskara II did not go beyond Brahmagupta's partial 
work. We refrain from giving details in this article. While there has been a recog- 
nition of the bias of certain writers, their warped views and erroneous remarks 
have had an influence on several subsequent authors and result in a distorted repre- 
sentation of ancient Indian mathematics even by more objective writers. (See also 
section 9.) 

Eurocentrism declined with passage of time. But in the subsequent period there 
has not been as much of dedicated scholarship on ancient Indian mathematics 
as compared to, say, Greek or Babylonian mathematics. Moreover, well-edited 
publications with English translations and commentaries of even the major ancient 
Indian mathematical treatises are not always available easily. Let us consider the 
current situation regarding some of the ancient texts relevant to the present article. 
The INSA publication on Aryabhatiya ([ShuS]) has long been out-of-print. As far 
as the present author is aware, the original verses of Brahma Sphuta Siddhanta 


?9There seems to be an allusion to the remark on cakravala by H. Hankel: “Ir is beyond all praise; 
it is certainly the finest thing achieved in the theory of numbers before Lagrange." (Zur Geschichte der 
Mathematik in Alterthum und Mittelalter, Leipzig. 1874, p 202.) 
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were last published in 1966 ({Sha]), with Hindi translation. Not all chapters (not 
even all the mathematics chapters) of Brahmagupta’s magnum opus have English 
translations. Colebrooke's partial translations ((Col1]). though valuable, are not 
always lucid. Jayadeva's verses occur only in the 1954 paper ([Shul ]). The picture 
is somewhat better with respect to the mathematics texts of Bhaskara II. A well- 
known publisher has brought out an English edition, with notes, of Liavati ([PNS ]), 
based on the Marathi work of N.H. Phadke. Two useful recent editions of Bijaganita 
are mentioned in the bibliography ([Ab]; [P]); however, neither is free from errors. 
Publications on ancient commentaries of the original treatises are even more scarce. 
As M.D. Srinivas emphasises in [Srn], commentaries are no less important than 
the original treatises for a proper comprehension of the history of ancient Indian 
mathematics. 

Results like bhāvanā and cakravála have evoked deep appreciation from mathe- 
maticians of high eminence, as can be glimpsed from the statements of Weil (sections 
1,5,7) and Atiyah (section 9) or the tribute of Ojanguren (section 6). However, such 
observations occur at diffused places — as incidental remarks in a sophisticated 
book, in an isolated lecture, or in a highly specialised research monograph — far 
removed from the attention of the general scholar. Going by some of the books on 
history of mathematics in recent decades, it would appear that not all historians are 
sufficiently aware of how highly the ancient Indian works are regarded by some of 
the great mathematicians of modern times.” 


4. Long Division, GCD and Euclidean Algorithm 


Before commencing our discussion on the kuttaka, we shall make a brief mention of 
the Euclidean algorithm which is a key step in the solution of the linear indeterminate 
equation. We first discuss the antiquity of the method of "long division" in ancient 
India. 


Computational Arithmetic in Ancient India: Long Division 


Familiarity with the four fundamental operations of arithmetic is evident in Vedic 
literature like the Satapatha Brahmana, the Taittiriya Samhita and even the 


3°Many accounts, even by highly respected historians of mathematics, omit Indian contribution 
altogether. To give two examples:- The History of Mathematics: A Reader (1987) by J. Fauvel and J. 
Gray has no representation of mathematics in India; the monograph The Beginnings and Evolution of 
Algebra (2000) by I.G. Bashmakova and G.S. Smirnova has no mention of the algebra of post- Vedic 
ancient India. 
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Rg-Veda. A passage from the ancient Satapatha Brahmana (quoted in [Sr2, p 5-6]) 
gives all divisors of 720. The Sulba-sütras show clear use of arithmetic operations 
including division; they even give examples of division by fractions. We however 
do not know about the actual techniques used in the Vedic era. 

Now, although there was along tradition of computational mathematics in ancient 
India, a major difficulty in the reconstruction of this history is that the post- Vedic 
mathematical works preceding Aryabhatiya (499 CE) are lost?! Aryabhata men- 
tions at the beginning (verse 1) of the mathematics chapter Ganita of Aryabhatiya 
that he is recording ancient knowledge “honoured at Kusumapura". His commen- 
tator Bhaskara I makes an incidental reference to names like Maskari, Pürana, 
Mudgala, Pütana, referring to them as Ácáryas (Masters). However nothing is 
known to us about such revered mathematicians. 

The gaps in our knowledge regarding the development of computational tech- 
niques in India may be illustrated by the history of "division". A variant of the 
modern method of "long division" is described in the arithmetic treatises of 
Sridhara (750 CE), Mahavira (850 CE) and later writers (see [DS1, p 150-154] 
for details). But how old is their method of “long division"? Aryabhata's inge- 
nious algorithms for computing square roots and cube roots (which are again 
slight variants of the present methods) involve the technique of long division. 
However, neither Aryabhata (499 CE) nor Brahmagupta (628 CE) bother to ex- 
plain any method of division separately. They take for granted the knowledge of 
the long division method when they describe their algorithms for extraction of 
square roots and cube roots. One can only conclude that the device had become 
firmly established well before the 5th century and that the great Masters Aryabhata 
and Brahmagupta considered the method to be too elementary for their respective 
treatises. 

We mention here that the operation of division was regarded as a difficult op- 
eration by European scholars from ancient times till as late as the 15th century. 
Luca Pacioli remarked in 1494 CE: "if a man can divide well, everything else is 
easy, for all the rest.is involved therein." ([Sm, p 132]). Not much is known about 
ancient European methods of division. The Indian version of long division was 
transmitted to the Arabs and occurs in Arabic works from the 9th century onwards. 
From the Arabs, the method went to Europe where it came to be known as the galley 
(galea, batello) method (also called “scratch method") which became very popular 
in Europe during the 15th-18th centuries. The method was very convenient in the 


3lperhaps the original text of the Baksali manuscript could be anterior to Aryabhatiya. Its date is 
controversial; some historians like B. Datta place it at around 200 CE. 

32Evidently they had composed voluminous mathematics texts; for Bhaskara I explains why 
Aryabhata records only a “bit of mathematics" ([Sar, p 68-69]; [Shu4, p liv-lv]): “This has not been 
composed as a treatise by dcdryas Maskari, Pürana, and others. with even one lakh verses for each 
[topic]. How can the Acdarya [Áryabhata] manage [to state all of] it within such a short treatise?” 
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earlier system where erasing of figures was easy and desirable. * The paper-and-ink 
system was less conducive for it and eventually led to the modern arrangement — 
a modification of the galley method. 


Greatest Common Divisor (GCD) 


As the name suggests, the GCD of two positive integers is the largest integer which 
divides both the numbers (without leaving any remainder). As we shall see in next 
section, the first major step in the kuttaka is the computation of the GCD of two 
numbers. In the Aryabhatiya, the algorithm for computing GCD is alluded to in a 
terse way. In fact, all expositors on kuttaka — even those who are generally more 
elaborate — refer to the algorithm for computing GCD by repeated division without 
much elaboration. : 

One can only conclude that, before the time of Aryabhata, ancient Indians had 
become familiar with the technique of computing GCD, presumably by long divi- 
sion. As is the case with most concepts and results occurring in Aryabhatiya, we lack 
the precise knowledge regarding the genesis of the concept of GCD in ancient India. 

Amidst all literature that is available to us, Euclid's Elements is the oldest text 
which describes an algorithm for computing the GCD. However, there is no concrete 
evidence that this algorithm was transmitted to Indian mathematicians. As one does 
not see any influence of other results from the Elements in the mathematics of 
Aryabhata and his successors (see the discussion in the second paragraph of (iv) 
of section 9), it is more likely that a division method for computing GCD evolved 
independently in ancient India. 


Reciprocai Subtraction 


Leta and b be the lengths of two line segments (that is, two “real numbers" in modern 
terminology). If a > b, then one subtracts b from a, then b froma — b ifa — b > b. 
then b from a — 2b if a — 2b still exceeds b, and so on, till one reaches a stage, say 
after qı subtractions, when the remainder rı :— a — q,b does not exceed b. Now, if 
b > rj, then, as before, one keeps on subtracting r; from b as often as possible, say 
q2 times, till one reaches the remainder r2 = b — qr; whichis < rı. The subtractive 
process is then continued with the lengths r, and r to get q3, r3, then with rz and r3, 
and so on. This process was called Antanairesis (reciprocal subtraction) in ancient 
Greek mathematics. The sequence (finite or infinite) of quantities (q1, g2. ... ) was 


33In ancient India, calculations (pátiganita) were usually performed on sand spread on a board called 
pati. In this format, the figures being big, not many lines of figures could be written on a board; but it 
was easy to erase figures. Comuptational techniques, therefore, were planned in such a way that fewer 
figures would be used at a time, and figures which are no longer required could be erased. See [DS1. p 
129]. 
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used to define the concept of Logos (ratio) in the theory of proportions developed 
by Greek mathematicians preceding Eudoxus (400-349 BCE). For further details, 
see [F] and [Vn, p 90-91]. 


Euclidean Algorithm 


Books VII, VIII and IX of Euclid’s Elements ({He]) describe the basic multiplicative 
properties of positive integers. As Weil remarks ([We2, p 4]): 


It is generally agreed upon that much, if not all, of the content of those books is of earlier 
origin, but little can be said about the history behind them. 


Irrespective of the actual genesis of specific results, the Elements (c. 300 BCE) is 
the oldest extant book which makes a systematic exposition of the fundamentals of 
"Number Theory". 

The Euclidean algorithm for finding GCD is described at the very beginning of 
Book VII of the Elements (Propositions 1—2); and presented again, in a geometrical 
form, in Book X (Propositions 2-3).*4 We briefly recall the method; for clarity, 
we take advantage of modern language, especially the subscript notation. For a 
more detailed mathematical exposition of the Euclidean algorithm and associated 
concepts like GCD at an elementary level, the reader may refer to [JJ, Ch 1]. 

Note that, for integers, the principle of Antanairesis or mutual “subtraction” 
may be abbreviated to the following principle of “division”: Given two positive 
integers a,b with a > b, there exist unique non-negative integers q,r such that 
a = bq +r withO <r < b. This result is often called the “Division Algorithm", 
although it is a theoretical statement rather than a practical algorithm — it does 
not describe any method to find q and r when a and b are given. Although Euclid 
too uses the language of repeated mutual subtraction in Propositions 1—2 of Book 
VII ((He, Vol 2, p 296-299]), the above principle (of taking the remainder of 
one number divided by another) is implicit in his proof. It is not known how 
Greeks performed division during Euclid's time,” and how they implemented 
the "Division Algorithm" for large numbers (by any method more efficient than 
repeated subtraction). As mentioned earlier, during Renaissance in Europe, the 
modern method of "Long Division" evolved out of the corresponding ancient Indian 
method. 


Jn Book VII, the algorithm is formulated for integers; in Book X, it is formulated for lengths of line 
segments. In the latter case, one defines the GCD, or rather its synonym the Greatest Common Measure 
(GCM), of two lengths a and b to be the greatest length g such that a and b are both integer multiples of 
g. The GCM will exist if and only if a and b are commensurable, that is, in our language, if and only if 
$ is a rational number. 

35Smith writes ([Sm, p 133]): *We are quite ignorant as to the way in which the Greeks and Romans 
performed the operation of division before the Christian Era." 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Kuttaka, Bhavana and Cakravála 165 


By the so-called Division Algorithm (or by Antanairesis). given two positive 
integers a, b, one has a sequence of relations a = ajb +r}. 0 < rj < b; b= 
aor, +972,0 < rj € ri ri = azr +73.0 < r} < ro: and so on, till one 
arrives at a stage where r„+1 = 0, that is, r4; = a4417,. The Euclidean algorithm 
asserts that r, is the GCD of a and b. It is based on the principle that the GCD of 
two numbers does not change if the smaller number is subtracted from the larger 
number. For instance, the GCD of 24 and 18 is same as the GCD of 6 (— 24 — 18) 
and 18. Consequently, GCD(a, b) = GCD(a — b, b) = --- = GCD(a — a;b, b) = 
GCD(b, rj) = GCD(ri, r2) = «+» GCD(r, I Tn) = r,. 

Although Elements is the earliest available text which gives a clear description 
of the algorithm, scholars of Greek mathematics believe that it was known much 
earlier, at least in its subtractive form. The Euclidean algorithm was known in China 
and in India. As explained earlier, in the absence of enough historical documents, it 
is not known whether it was discovered independently and when the algorithm was 
first introduced or discovered in these countries. 

The Euclidean algorithm is one of the oldest algorithms that is still in common 
use. A crucial feature of the algorithm is that it finds the GCD efficiently without 
requiring the determination of the prime factors of the numbers.?^ The algorithm 
is logarithmic in the size of integers. G. Lamé (1845) proved that the number of 
division steps required by the Euclidean algorithm to compute the GCD of two 
given integers never exceeds five times the number of digits (in decimal notation) 
of the smaller integer ((Kn, p 343]). 


5. Kuttaka 
Brevity in Indian Tradition 


The original mathematical treatises in ancient India (especially the earlier texts) 
were generally very brief. The verses were meant to indicate broad hints. There 
was emphasis on the use of a learner's own intellect for working out the complete 
details.” Detailed expositions on the works of the Masters were orally transmitted 
through the guru-sisya (mentor-disciple) link. Sometimes the commentaries give 
details. The approach of the ancient stalwarts can be seen from the following 


36Factorisation of large integers is regarded as a difficult problem; in fact, many systems of modern 
cryptography are based on the non-availability of means to determine factorisation of sufficiently large 
integers. 

37 As an extreme example, consider the brevity of Aryabhata's instruction for constructing a sphere 
which would rotate uniformly at the rate of Earth's rotation (Gola 22): “{Make] a wooden gola (globe) 
which is perfectly spherical and uniformly dense all around [but is] light in weight. Using mercury. oil 
and water, and applying one's own intellect, [make the globe] rotate [at the required rate] to keep pace 
with time." See [Du2, Resonance 11(3), p 67]. 
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remarks that Bhaskara II makes in the epilogue of his text Bijaganita ([Ab, p 53); 
[P, p 196-197]): 


The expanse of science is vast as the ocean, which a person of ordinary intellect finds too 
formidable to cross. But what is the necessity of details for the intelligent student? A mild 
instruction suffices; for it helps the intelligent student to develop the knowledge on his own. 
Just as a drop of oil put in water, or a grain of secret confided to a villain, or a little act of 
charity to the deserving, spreads automatically, likewise a quantum of knowledge, instilled 
into an intelligent mind, grows and expands extensively by its own force. 


This attitude is not much different from that of most modern mathematicians who 
too feel that a budding researcher eventually gains more insight from a terse text 
than from a clearly spelt-out text, from an obscure important paper than from a lucid 
one. (For further discussion, see [DS1, p 126]; [Dul, Remark 4; p 110-111].) 

The original description of kuttaka by Aryabhata (see Appendix) is excruciat- 
ingly brief. Historians of mathematics have interpreted his two verses based on 
ancient commentaries; for instance, B. Datta has been guided by the explanation 
of Bhaskara I, the earliest commentator on Aryabhatiya whose work is available 
to us. The descriptions by Bhaskara I, Brahmagupta and several other writers are 
quite similar, in essence. Aryabhata II observed some simplifying devices which 
were emulated by later writers. In Bijaganita, Bhaskara II makes a neat presen- 
tation of the main algorithm and the useful auxiliary principles and states a few 
suggestive examples where one could implement the prescribed rules ([Ab, Ch 5]; 
[P. Ch 5]). 

In this section, we shall make a summary of the various presentations of the 
kuttaka by authors ranging from Bhaskara J to Bhaskara II. Details take up 54 pages 
in the book of Datta-Singh ([DS2, p 87-140]). We shall try to highlight the key 
ideas which would be particularly enriching for students of mathematics. 


Formulation of the Kuttaka 


As the linear indeterminate equation had applications in astronomy, most Indian 
mathematicians considered the problem of determining all positive integral solu- 
tions of equations of the type ay — bx = +c, where a, b, c are positive integers. 

As an illustration of how a concrete problem like 100y — 63x = +90 would be 
usually phrased in verses, we quote from the Bijaganita (also Lilavati) of Bhaskara II 
([Ab, p 20]; [P, p 58]; [PNS, p 167]): 


If thou be an expert in kuttaka analysis, please tell me that number(s) which, when multiplied 
by 100 and increased or decreased by 90, would become divisible by 63 without a remainder. 


Thus the equation ay — bx = +c was visualised in the form y = prže, The 
quantities a, b, c, x, y were called hāra or bhājaka (divisor), bhājya (dividend), 
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ksepa (interpolator), gunaka (multiplier) and phala or labdhi (quotient) respectively. 
Bhaskara I arranges the equation in the form y = ts opm zs so as to ensure 
that the interpolator c comes with a positive sign. Almost all ancient Indian writers 
observe, in some form, that the equation ay — bx — c will have integral solutions 


only if c is divisible by the GCD of a and b called apavartana or apavarta. 


Reductions 


The treatment of the kuttaka reveals a tendency by ancient Indians to employ “reduc- 
tions”; an aspect which has pedagogic value for training budding mathematicians. 
We shall give a few examples. 

First, Bhaskara I, Brahmagupta, Aryabhata II, Sripati and Bhaskara II, among 
others, explicitly state that all the coefficients should be divided by GCD(a, b) 
(= GCD(a, b, c)), so that the coefficients in the reduced equation become relatively 
prime, or to use ancient terminology, [mutually] drdha (firm or reduced). niccheda 
(having no divisor), nirapavarta (irreducible). The first example 221 y +65 = 195x 
in the kuttaka section of the Bijaganita of Bhaskara II ([Ab, p 20]: [P. p 57]: also 
[PNS, p 171]) is meant to illustrate this reduction. The equation 221 y — 195x = —65 
reduces to 17y — 15x — —5. It may be pointed out to students that, by this reduction, 
not only do the coefficients become smaller (thereby simplifying computations). 
but, more importantly, one is now better equipped to tackle the problem as one has 
the advantage of the additional property of a and b being coprime — a property 
which is potentially useful. 

Another subtle reduction of Aryabhata II, applicable in case there is a common 
factor between a and c or between b and c, further reduces the size of the coefficients 
of x and y. Let gı = GCD(a, c). aj = *, g2 = GCD (v. £) and b, = 2. Then 
Aryabhata II, and his successors like Bhāskara Il, observe that the problem of 
solving ay — bx = +c reduces to the problem of solving a; Y — b; X = +1: if 
(u, v) is an integral solution of the latter, then (2. e) is an integral solution of the 
original equation. For instance, the problem of solving 100y — 63x = +90 (cited 
by Bhāskara II) reduces to solving 10Y — 7X = +1. Once one obtains the solution 
X = 3,Y = 2 of the reduced equation 10Y — 7X = —1 by the main algorithm 
(in this case, one can also get it easily by inspection!), one immediately gets the 
solution x — 30, y — 18 for the original equation 100y — 63x — —90. 

Second, the problem of finding all positive integral solutions is reduced to that 
of finding one positive integral solution. Suppose that (u, v) is a positive integral 
solution of ay — bx = +c. From (u, v), one first finds the minimum positive integral 


381 is now a standard trick in modern mathematics to be on the lookout for simplifications or 
“reductions” by which efforts are made to transfer a problem to an equivalent but neater and possibly 
more tractable problem where the underlying features become more transparent. 
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solution (a, /).? Dividing u and v by a and b respectively, we have u = pa +r 
and v = qb + s for some whole numbers p,q,r,s such that O < r < a and 
0 < s < b. If p = q, then (r,s) is clearly a solution of ay — bx = +c; in fact, it 
is the minimum positive integral solution (proof is easy). If p Æ q. then it can be 
seen that p « q when we consider the equation ay — bx — c; and p » q when 
we deal with ay — bx — —c. The minimum positive integral solution in the two 
cases are (r, s + (q — p)b) and (r + (p — q)a, s) respectively. The above rule for 
arriving at the minimum solution has been explained lucidly by Aryabhata II but 
is already implicit in Bhaskara I. If (a, £) is a minimum positive integral solution 
of ay — bx = +c, then Bhaskara I and his successors describe the general positive 
integral solution as (a -+ ta, f + tb) where t is a positive integer. 

Third, it is clearly enough to solve an equation of the type ay — bx = +1; 
for, if av — bu = +1, then a(cv) — b(cu) = +c. Such equations were called 
sthira-kuttaka (constant pulverizer). This simplification too was made by some of 
the Indian mathematicians right from Bhaskara I. As we shall see towards the end of 
this section, in problems of astronomy involving the equations ay — bx = c, the 
conditions were often such that the coefficients a, b would be the same in several 
equations but the interpolator c would vary. In such situations, working first with 
the constant pulverizer and then modifying the solution according to the specific 
problem would have been convenient. 

Some of the Indian authors also showed that we can reduce to the casea > b. This 
step was achieved through two different methods. The later method corresponds to 
the modern approach: if b > a, think of the equation as bx — ay = +c rather than 
ay — bx = +c. But earlier writers like Bhaskara I, who already had to arrange the 
equation ay — bx = c so as to have only positive coefficients, used the following 
device when b > a: Let b = aq + bj where bj < a. Then the original equation 
transforms into the equation ay; — bjx = c (where yı = y — qx) which is of 
the desired form. If (u, v) is a solution of ay; — bix = c, then (u,v + qu) is a 
solution of ay — bx = c. We shall henceforth assume a > b as it would facilitate 
our discussions. 


Main Steps 


We now describe the crux of Aryabhata's algorithm using the interpretation of 
Bhaskara I. By successive division, we have a = ab + ri, b = axrı +m, ri = 
43r2 + r3,--- and so on. Let n denote the number of steps after which the process 
terminates. Since the GCD of a and bis 1, the final relationisr, 2 = a,r,—1--1(1 < 
Fn-| € rg-2 € ::: < rj < b). Thus, r, = l and r444 = 0. Given a and b, the 


39Sincea and b are positive, if (x1. y1), (x2, y2)are pairs of positive integers satisfying ay—bx = +c, 
then xı < x? ifandonly if y; < y». Therefore, one can talk about a “minimum positive integral solution" 


(a, p). 
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quantities aj, --- , a, can be quickly determined by the method of “long division" 
for computing the GCD of a and b. 

Now, for solving ay — bx = 1, define quantities Xn+2, Xn+1, Xn, --- by backward 
induction as follows: define x,+2 and x,.; to be whole numbers satisfying the 
relation 7, 1x,42 — X441 = (— 1)”. Thus, if n is odd, one can simply take x,.2 = 0 


and x44; = 1: if n is even, one can take x,.2 = 1 and x,.; = r,-.; — l. Now 
define x,, (n > m > 1) by Xm = amXm+1 + Xm+2- For facilitating the computation 
of Xn, +--+ , X2, xj. the Indians constructed convenient tables called valli. As will be 


clear from subsequent discussions, the numbers x4, x2 satisfy ax; — bx, = 1; thus 
(x1, x2) is a solution of the equation ay — bx = I. 

As a simple example, consider 10y — 7x = 1 (the reduced form of one of the 
exercises set by Bhaskara II). Here 


a = 10,b = 7; so that ay = l1,r; —3;a5 =2,m = l. 
The reverse algorithm yields 
X421; x4—3—]1 = 2; x2(— aoxa + x4) = 5; xy(— yx + x3) — 7. 


Thus we obtain the solution x — 7, y — 5. 

Bhaskara I explains that one need not continue the repeated division till the stage 
r, = 1. One can terminate at an intermediate stage k if, having obtained r1, - - - , Tk, 
onecan observe, by inspection, positive integers u, v satisfying ry-,u—rgo = (—1)*. 
One can then define xz+2(= u), Xy41(— 0), +-+- , X2 (= y), x1 (= x) recursively, as 
before, and obtain a solution of the equation ay — bx = 1. 

For solving ay — bx = —1, one considers numbers x44; (k € n) and x,+2 
satisfying rk—1Xk+2 — rkXp1 = (— 1)***; the rest is as above. Or, one could take 
the approach of Bhaskara II: solve ay — bx = 1 and use the fact that if (a, f) is 
the minimum positive integral solution of ay — bx = 1, then (a — a, b — f) is the 
minimum positive integral solution of ay — bx — —1. 

For instance, consider 10y — 7x = —1. While a4, r1, a2, r2(= 1) are as before, 
now the reverse algorithm yields 


x4 = 0, x3 = 1 so that x2(— a2x3 + x4) = 2; xı(= a1x2 + x3) = 3; 


and we obtain the solution x — 3, y — 2. Alternatively, having computed the 
solution (7, 5) of the equation 10y — 7x — 1, one can immediately deduce the 
solution x = 10 — 7 = 3 and y = 7 — 5 = 2 for the equation 10y — 7x = —1. 
Brahmagupta, Bhaskara II and Narayana give similar rules for deriving integer 
solutions of ay + bx — c. 

In section 4, we had mentioned about the efficiency of the Euclidean algorithm 
for determination of the GCD. As can be seen from the above steps, the extension, 
by kuttaka, to the solution of the linear indeterminate equation ay — bx = c, is 
again an efficient algorithm. 
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The Descent-Like Idea 


As indicated by Weil in [We2, p 93], there is a descent-like idea in the kuttaka 
method for solving ay — bx = +1. We may formulate it as follows (to fix our ideas, 
let us take the RHS to be 1): 


(1) Assume the existence of a positive integral solution (x, y) to the equation 
aY —bX = 1. 


(2) Then transform this equation in successive steps into equations with smaller 
and smaller coefficients and solutions to eventually arrive at an equation 
a'Y' — X' = +1 with an obvious solution (x^, y^) = (a't + 1, t) for any t. 


(3) Then work backwards from this obvious solution (x’, v^) of the reduced 
equation to determine the desired solution (x, y) of the original equation. 


To elaborate, assume that (x, y) is a positive solution of the equationaY — bX = 1. 
Recall that we are assuming 0 < b < a and a, b are coprime. If a — b = 1, then 
(d, 1)isasolution of aY — bX = 1. So we may further assume that a — b > 1. Then 
x > y. lf b had been 1, we would have been through. This prompts the approach 
(instructive for students): why not try to “break” (i.e., *pulverize") the coefficients 
a, b into smaller ones? Recall the relation a = ajb + ry with O < r, < b. Each 
component of the pair (b, r) is smaller than that of the pair (a, b). Now one tries to 
transform the equation aY — bX = | to get an equation with coefficients b and rj. 
The relation ay — bx = | leads to the relation (aj y — x)b 4- ri y = 1. 

Set z := x — ay. Then clearly 0 < z < y and (y, z) is a positive solution of 
the equation bZ — r, Y = —1 which is again a linear equation of the original form; 
but now the solution (y, z) is co-ordinate-wise smaller than the original (x, y) and, 
moreover, the coefficients (of the new equation) too have become smaller since 
0 <b <aand0 <r; < b. The two pairs of solutions being linearly related, if one 
can determine the smaller pair (y, z), one can easily compute the original (x, y). 

Denote x, y, z by xj, x2, x3 respectively. Proceeding as above (introducing 
X4,X5,*--, etc.), since r, = l (as per our earlier notation), one eventually ar- 
rives at the easily solvable equation r,-; Xn+2 — X,441 = (—1)". From this equa- 
tion, one works backwards to arrive at the solution (x, y) of the original equation 
aY —bX =1. 

For instance, in the example 10x? —7x, = 1, the kuttaka uses the fact that solving 
10x2 — 7x; = 1 is equivalent to solving 7x3 — 3x2 = —1 (where x3 = xı — x2). 
which, in turn, is equivalent to solving 3x4 — x3 — 1 (where x4 — x; — 2x3); and 
the latter has the obvious positive integral solution x3 = 2, x4 = 1. 

Thus, the main algorithm itself is an illustration of a non-trivial application of the 
modem “reduction” principle: it transfers, through a sequence of steps, a somewhat 
involved problem to a problem with an obvious solution. Since the process involves 
the breaking up ofthe original data (both solutions and coefficients) into successively 
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smaller numbers by repeated division, the term kuttaka (pulverizer) is apt; in fact, 
it gives a hint to the learner of the main idea behind the algorithm. Thus the kuttaka 
(both the method as well as the term itself) resembles Fermat's celebrated principle 
of descent. As Weil remarks ([We2, p 7]): 


In later Sanskrit texts this became known as the kuttaka (= "pulverizer") method; a fitting 
name, recalling to our mind Fermat's "infinite descent". 


Euclidean Algorithm and Kuttaka 


There is a tendency among writers on history of mathematics to make remarks to the 
effect that the solution to the linear Diophantine equation "is essentially identical 
with" or “does not differ substantially from" or “is just a straightforward application 
of" the “Euclidean algorithm" for finding the GCD, and thereby to undermine (often 
inadvertently) the achievement involved in the discovery of the kuttaka. As we have 
seen, there are two major steps or principles involved in the ancient Indian kuttaka, 
as also in Bachet's solution of the linear indeterminate equation ay — bx = c. 


I Computation of the GCD d of a and b. 
II. Expression of d as a linear combination of a and b. 


In modern college-algebra, the two steps may be viewed, respectively, as corre- 
sponding to the algorithmic proofs of the following generalised abstract statements: 
(D “Zis a Euclidean Domain and in a Euclidean Domain, any two non-zero elements 
have a GCD.” and (II) "Any Euclidean Domain is a PID.” 

Step II does not occur in the Elements or in any extant ancient Greek text. Its 
explicit presence can be seen in ancient Indian texts; in Europe, it occurs for the 
first time in Bachet's work (1624). Modern mathematicians, well-trained in years 


40A Euclidean Domain is an integral domain R in which the "Division Algorithm" holds for any two 
elements a and b of R with b ¥ 0; that is, there exists a (non-negative) integer-valued function N on R 
with N(0) = 0 such that given a, b(¥ 0) € R, there exist q,r € R such that a = bg + r with either 
r = Qor N(r) < N(b). A PID (Principal Ideal Domain) is an integral domain in which every ideal is 
principal, that is, every ideal is of the form Rd for some d € R. 

The concept of GCD of two integers has the following generalisation in the case of an integral domain 
R:- d is said to be a GCD of two non-zero elements a and b of R if (i) a = ad and b = bid for some 
ai, bı € R, and (ii) if d' € R is such that a = a'd’ and b = b'd' for some a’, b' € R, then d = did’ 
for some dı € R. As in Euclid's Elements (Proposition 2 of Book VII), the repeated application of 
Division Algorithm shows that any two non-zero elements of a Euclidean Domain have a GCD and that 
it can be computed algorithmically. By describing, again algorithmically, how to express the GCD as 
a linear combination of a and b, Aryabhata's kuttaka shows that the ideal generated by two non-zero 
elements a and b of a Euclidean Domain R is the principal ideal generated by their GCD thereby giving 
a constructive proof of the property that any Euclidean Domain is a PID (the converse is not true). For 
further details on Euclidean Domains and PIDs, see chapter 8 of Abstract Algebra by D.S. Dummit and 
R.M. Foote (Wiley 2003). 
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of high-school classical algebra, sometimes think of Step II as a trivial corollary to 
Step I. But to appreciate Step II in the context of its history in India, one should 
note that it was discovered before algebra had formally established itself in Indian 
mathematics and that at the time of discovery and for a few centuries thereafter, it was 
considered a significant feat by some of the strong ancient Indian mathematicians. 
Even Brahmagupta's treatise imparts considerable emphasis on the kuttaka and 
provides plenty of illustrative examples. All of them touch Step I lightly, they dwell 
mainly on Step II. One has also to note that, after Bachet's rediscovery (1612) of the 
kuttaka, Legendre had described the method as very ingenious as late as in 1798." 

A profound aspect of the kuttaka method for solving Step II is that it contains 
the seeds of the idea of "descent". In fact, Bachet's rediscovery of the method might 
have, in a subtle way, influenced the discovery of the "descent" principle. While 
analyzing Brouncker's solution to the equation Dx? + 1 = y?, Weil brings out 
the implicit "descent -like idea in Brouncker's method and remarks that Bachet's 
method must have provided a model for Fermat and Brouncker. We quote below 
excerpts from the passage of Weil ([We2, p 93]); we have highlighted a portion in 
italics. 


The starting point here is to assume the existence of a solution (x, y) of x? — Ny? = 1, and 
to transform the problem by successive steps into others, each of which has a solution in 
smaller numbers; . . . this is typical of Fermat's descent; the difference lies in the fact that in 
the latter case one seeks to prove that there is no solution, while in the former the purpose is 
to find one. At the same time, Bachet's method for the solution of the equation ay — bx = +1 
undoubtedly provided a model for Wallis, Brouncker and Fermat to follow, . . . 


Weil then revisits Bachet's solution of the linear indeterminate equation, bringing 
out its descent-like aspect. It may be noted that the discovery of the Indian cakravála 
(which also has a descent-like aspect to it) had taken place in an environment where 
the kuttaka method had a strong influence; in fact, the kuttaka algorithm itself was 
used in the solution (as we shall see in section 7). 

The above considerations suggest that the kuttaka is perhaps more subtle than 
what its description as being "essentially the Euclidean algorithm" would indicate. 
Step II is of interest in computational number theory and is a fundamental step 
in cryptography. The equation d — ay — bx, expressing the GCD d as a linear 


4l A ain, one has to consider the historical context: around Bachet's time symbolic algebra had just 
begun to establish itself in Europe. Even F. Viète (1540-1603), who had a pioneering role in introducing 
symbolic algebra in Europe, did not consider negative numbers. In fact, till the 16th century, Arab and 
European mathematicians had struggled even with problems involving equations of the type ax +b = c 
(where a, b, c are positive numbers) as can be seen from the prevalence of the cumbrous “rule of false 
position". As D.E. Smith remarks ([Sm, p. 437]): “To the student of today, having a good symbolism 
at his disposal, it seems impossible that the world should ever have been troubled by an equation like 
ax + b = 0. Such, however, was the case, ..." 
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combination of integers a and b, is sometimes known as Bézout's identity ([JJ. p 7]): 
perhaps Áryabhata's identity would be more appropriate historically. 


Continued Fractions 


The kuttaka may be interpreted as a technique in the theory of continued fractions. 
The formulation y = Uu and method of solution have given rise to a suggestion that 
the pioneers of the kuttaka algorithm knew the basic principles of continued fractions 
([Bg]). Knowledge of continued fractions is more apparent in some of the later Indian 
works. In the versions described above, after obtaining the quotients a;,--- , an, 
one computes quantities X», x,—,,--- in the backward direction (i.e., one moves 
from below to top in the table). But in certain later texts like the Karanapaddhati and 
the Yuktibhasa (1540 CE) of Jyesthadeva, having obtained the quotients aj, - - - , dy, 
one constructs sequences of numbers pm and qm by the recurrence relations 


po = 13 pi = 41; Pm = 4m Pm-1 Fia Pm-2 


qo = 0,q1 = 1; Gm = AmGm=1 + Ym-2 


till one reaches p, and q, (where n is as before). Then aq, — bp, = (—1)"*!, i.e., 
(Pn, Gn) is asolution of one of the equations ay —bx = +] (the solutions of the other 
equation can be derived using techniques discussed earlier). Note that the quantities 
A1, A2, :** , An, dg 41 (= rni) are the “quotients” in the continued fraction expansion 
of 7; and E are the successive “convergents” (which are actually characterised by 
the above recurrence relations). The solution of the linear Diophantine equation in 
the notation of continued fractions was given by Saunderson in England in 1740 
and Lagrange in France in 1767. 


Interpretation of Sathaye 


We have discussed the traditional Indian approaches to kuttaka due to Bhaskara I 
and later mathematicians. Avinash Sathaye has given an interesting and efficient 
version of Aryabhata’s algorithm that is lucidly explained, with an example, on his 
website ([Sat, chapter 3]; especially section 3.2). Sathaye's version can be presented 
as follows. 

We are to find x and y such that ay — bx = d, where d is the GCD ofa and b. Let 
41,02, ::- be the successive quotients and rj, r2, --- be the successive remainders 
in the standard division algorithm for computing the GCD of a and b. Thus, if 
a > b, we have, ry = a — a;b, r2 = b — arı = —a»a + (1 + a1a5)b, and so on. 
Now set 


uo :=0, vo :— 1, ui := 1, vi := —ay. 
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Thus au, + bo, = ri. Define un, Vn inductively by: 
Un = Un—2 — Gn Uy—13 Ug = Uj—2 — GnVn-1- 


Then one can see that rj, = au, + bv, for each m: in particular, if d is reached at 
stagen (i.e. ifr, = d),then(—»,, u,,)isasolution of ay—bx = d.Infact, at the next 
stage, we get aUn+ı 3- bo, 44 = O(sincer,4, = O)sothat(—0, — £tUj 41, Un d Ia) 
is a solution for any integer £. 

While the above algorithm fits the terse verses of Aryabhata, it requires the use 
of negative numbers (otherwise, one has to make a careful adjustment of sign as 
before). One is not sure if the use of negative numbers had become sufficiently 
well-established (even among any elite section) during his time. But irrespective of 
whether Aryabhata actually meant it, the above version would be a useful supple- 
ment for pedagogic purposes. 


Samslista Kuttaka 


Ancient Indian algebraists also solved simultaneous linear indeterminate equations 
of the type 


Diy — aixı = C1, boy — a2x3 = C2, --: , bey — arx, = Cr 


under the obvious consistency conditions. Note that the special case b; = b» = 
---b, = 1 is the “Chinese Remainder” problem of finding an integer y which when 
divided by a; leaves a remainder c;, when divided by a» leaves a remainder c2, and 
so on. In fact, Aryabhata's verses actually mention the problem of simultaneous 
remainders (for r = 2). It is not clear from his brief verses whether Aryabhata 
considered only the case b; = b2 = 1 or the more general case; the commentary on 
Aryabhata’s verses by Parame$vara (c. 1430 CE) explains the general case itself. 
Bhaskara I and Brahmagupta had discussed examples where b; = --- = b, = 1. 
This system was called samslista kuttaka (conjunct pulverizer). The generalised 
conjunct pulverizer was considered by later mathematicians like Mahavira (850 
CE) and Sripati (1039 CE). 

Bhaskara I used the term niragra-kuttakara (non-residual pulverizer) for the 
problem of solving simultaneous linear indeterminate equations; and the term sagra- 
kuttakara (residual pulverizer) for the equivalent problem of solving simultaneous 
remainders. These problems have applications in computations involving periodic 
events — for instance, in calendar-making and in the determination of the time 
when a certain alignment of celestial bodies would occur (e.g., the determination 
of the time of eclipses). To see the connection, note that if k events Ej, --- Ey occur 
regularly with periodsa;, - - - , ay with E; happening at times c;, c; +4;, : - - , then the 
k events occur simultaneously at time N where N is a number which, when divided 
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by each aj, (1 < i < k), leaves remainder c,. An astronomy problem based on a 
linear indeterminate equation was called graha-kuttakara (planetary pulverizer). 


An Application in Astronomy 


Ancient Indian astronomers conceived of long time-periods (yuga, kalpa, etc) in 
which the five visible planets along with the Sun and the Moon were estimated 
to execute integral numbers of revolutions starting from a time when the above 
celestial objects were "in conjunction" — that is, all were on the great circle of 
the "celestial sphere" passing through the “vernal equinox” and the pole of the 
"ecliptic".? For instance, by the estimates in Aryabhatiya ([ShuS, p 6]). a yuga 
of 4320000 "sidereal years" has 1577917500 civil days and, during this period, 
Saturn and Mars perform, respectively, 146564 and 2296824 revolutions. Ancient 
astronomers must have used the kuttaka for estimating the moments when the visible 
planets (and the Sun and the Moon) would be in conjunction. It is perhaps significant 
that Aryabhata, who begins his treatise by mentioning the time of commencement 
of the current yuga and the number of revolutions performed by each planet in a 
yuga, ends his mathematics chapter by describing the kuttaka. 

We end this section by giving one example of a typical application of the kuttaka 
in Indian astronomy texts. Let D denote the number of civil days in a certain epoch 
and N the number of revolutions performed by a planet in that epoch. Let y denote 
the ahargana, the number of mean civil days elapsed since the beginning of the 
specified epoch up to a given day: and let x denote the bhagana, the number of 
complete revolutions performed by a planet during this period. Note that x is the 
quotient obtained by dividing the product yN by D. The kuttaka was applied to 
determine the ahargana y and the bhagana x from the residue r = yN — xD (D 
and N are known constants). For instance, in Laghu-Bhdaskariya (chapter 8, verse 
17), Bhaskara I gives certain algebraic conditions satisfied by the residues of Saturn 
and Mars (on a given day) and asks the reader to calculate the ahargana (for that 
day) and the bhagana of Saturn and Mars ([Shu3, p 99]). Since Saturn performs 
146564 complete revolutions in 1577917500 days, on dividing the two integers by 
the common factor 4, we get an intermediary epoch of 394479375 days in which 
Saturn performs 36641 revolutions. Taking D — 394479375 and N — 36641, 
one gets the sthira-kuttaka 36641Y — 394479375 X = 1 from which the solutions 
for y and x can be obtained depending on r. In the case of Mars, one gets the 
sthira-kuttaka 191402Y — 131493125X = 1. 


an explanation of the above terms is given in [Du2, Part 1: The Khagola (The Celestial Sphere) 
and Part 2: The Naksatra Dina (Sidereal Day)] and the significance of yuga in Indian astronomy has been 
discussed in [Du2, Part 3: A Brief History]. 
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6. Bhavana 
The Main Principle 


Brahmagupta's presentation of results on quadratic indeterminate equations resem- 
bles a typical modern arrangement where one first develops the fundamental theory 
or principle that emerges out of the investigations into a problem, and then records 
the solution to the original problem(s) as one application of the basic theory. Thus, 
after announcing Atha Varga Prakrtih®, Brahmagupta immediately states the fol- 
lowing result ([Sha, Ch 18; verses 64—65]) which is the cornerstone of the entire 
section. 


Theorem 1 (Brahmagupta’s Bhavana). 
If yi? = Dxj?--mi,i = 1,2, then y = Dxxz: +y y2, x = xi yox xo yj, M = mm 
satisfy y? = Dx? + m.” 


The Appendix quotes the portion of Brahmagupta's verses pertaining to the 
above result. Theorem | is also stated in verse form by Acarya Jayadeva, Bhaskara 
II (1150), Narayana (1350), Jnanarája (1503) and Kamalakara (1658); a proof is 
described in the Bijapallava (1548) of Krsna ([DS2, p 148-149]). 

Let D denote a fixed positive integer. For convenience, we shall use the notation 
(p, qm) to denote a triple of numbers satisfying Dp? + m = q?. Then, in modern 
language, Theorem | states that the solution space of the equation Dx? + m = y? 
admits the binary operations 


(xi, yiim3) © (2, yo; m2) = (x1 y2 + x2yi, Dxixo + yiy2i mam»). 


Qa, yii m1) * (x2, y2; m2) = (X1 y2 — x2yi, Dxix2 — y1 y2; mam»). 


Ancient Indian algebraists realised the importance of the composition laws (denoted 
above by © and x) and used a special technical term bhāvanā (composition). 
Jayadeva mentions the terms samdsa-bhavana (additive composition) and visesa- 
bhāvanā (subtractive composition) for © and x respectively ([Shul, p 5]). 


Batha: “[to commence] now”. The word has the nuance of an auspicious inception which is difficult 
to convey in English. 

HWhile the verses clearly refer to y = Dxıx2 + yı y2 and x = xı y2 + x271, they also appear to 
include y = Dxıx2 ~ yı y2 and x = x, y2 ~ x2y1. As discussed in ([Du1, p 108—109]), the expressions 
saha and aikyam indicate “combining” Dx; x2 with y; y2 and x; y» with x2 y;. The sense of “combining” 
certainly includes the implication “adding”; thus the reference to the more important additive principle 
(producing new roots Dx; x2 + y1 y2 and x1 y2 +x2y1) is ^bvious. Presumably, “combining” also includes 
“taking difference” (that is, alludes to the roots Dx1x2 ~ yj y2 and xj y ~ x21). In any case, subsequent 
expositors like Jayadeva and Bhāskara II clearly describe the two principles successively in separate 
verses. 
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In the commentary on Brahma Sphuta Siddhanta by Prthidakasvamin (c. 860 
CE). it is repeatedly emphasised, during the exposition of Brahmagupta’s verses 
stating Theorem 1, that the new roots (x, y, m) have been obtained from the given 
roots (x1, yr, mi), (x2, y2, m2) by composition ([Col, p 363—364, footnotes]). Just 
before stating the bhāvanā formulae, Jayadeva mentions ([Sh]. p 5) that the bhavaná 
operations pervade endless algorithms.» The importance attached to the bhavana 
also comes out in the version of Theorem 1 due to Bháskarall (see [Du1, p 111—112]) 
from which we quote an excerpt: 


Set down successively the lesser root [x;]. the greater root [»;] and the interpolator [m; ]. 
Place under them, the same or another [triple x2, y». m2], in the same order. From them, by 
repeated applications of the b/idvaná, numerous roots can be sought. Therefore, the bhdvand 
is being expounded. ... 


The perception of Theorem 1 as a law of composition (i.e., in terms of an 
operation like ©) comes out in the applications by Brahmagupta and others, in the 
version of Theorem | due to Bhaskara II and, above all, in the choice and use of the 
term bhavana. 

In general, the Sanskrit word bhdvand has several meanings including “pro- 
duction" and “demonstration”; in the context of mathematics, bhadvand means 
"composition" or “combination”. A similar Sanskrit word b/iavita (usual mean- 
ing: "created", "produced"; or "future", “to be") was adopted in Indian algebra 
by Brahmagupta and others to denote the product of different unknown quantities. 
The law © (or *) was perceived as a special mathematical operation involving a 
special multiplicative principle. In fact, Udaydivakara (1073 CE) explains bhavana 
as multiplication ([Shul, p 6]). 

While the precise sense of bhávana, in the context of © (or +), is that of a principle 
of “composition” — a sort-of generalised “product” — the term also carries the 
additional suggestion of being a principle of “production” (i.e., generation") of 
new roots, and of being a powerful “lemma” (i.e., "producer" of new results). The 
description of Theorem 1 by Brahmagupta and other Indian algebraists are invariably 
followed by results crucially involving applications of the bhāvanā principle ©. In 
view of the importance of the laws like C, an appropriate terminology was chosen 
which would be rich in significance — simultaneously conveying the sense of a 
multiplicative principle of composition, a principle of generation, a special lemma, 
etc. 

Theorem 1 can be formulated in the form of the identities 


Qi? — Dx1?)(y2? — Dx?) = (Dxxz + yiy2)! — DG ya E X271) 


which are sometimes called Brahmagupta's identities. 


35In the light of modern developments, Jayadeva's statement has turned out to be prophetic. 
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Some Applications of the Bhavana by Brahmagupta 


After stating Theorem 1, Brahmagupta gives useful offshoots. First, he indicates 
({Sha, Ch 18, verses 64—65]) how to construct infinitely many rational solutions of 
Dx? + 1 = y? (to be discussed in next section). Next he points out ([Sha, Ch 18, 
verse 66]) that infinitely many integer solutions of Dx? +m = y? can be generated 
from a given solution (if it exists) and a solution of Dx? + 1 = y? (see below for 
examples with D = 13); in particular, from one integer solution of Dx? +1 = y?, 
one can generate infinitely many integer solutions ({Dul, p 90—93]). And then, 
Brahmagupta gives his partial solution to the original problem ([Sha, Ch 18, verses 
67-68]; [Dul, p 93-95]). 


Theorem 2 (Brahmagupta). 


(i) If Dp? +4 = q?, then (3 p(q? — 1), 3q(q? — 3)) isa solution of Dx? -- 1 — 


i 
s 


(ii) If Dp? — 4 = q?, andr = 


(q? + 3)(q* + 1), then (pqr, (q? + 2)(r — 1)) 
is a solution of Dx? + 1 = y? 


1 
y. 

Let (p, qm)" denote (p, q;m) © (p, qim)--- © (p, qm) n times. The results 
(i) and (ii) can be obtained by considering the products (p, q;4)? and (p, q;4)* 
respectively and making necessary substitutions and simplifications. For instance, 
for (i), using the relations (p, q;4) © (p, q:4) = (2pq. Dp? + q?: 4?) and Dp? = 
q? —4, and cancelling the common factor 4 from the consequent identity D(2 pq yc 
4? = (20? —4)?. one gets the triple (pq, q?—2; 4). Next, considering the composition 
(Pp, 9:4) © (pq, q? — 2;4) and making adjustments as above, one would arrive at 
the triple (3 p(q? — 1), q(q? — 3); 1). 

The results are illustrated by numerical examples which are usually in the spirit 
of "good exercises" — demanding a degree of ingenuity — rather than mechanical 
applications. We quote below one example ([Sha, Ch 18, verses 71—72]) which 
illustrates the technical power of Theorem 1. 


Illustrative Examples 
Example 1 (Brahmagupta). Solve, in integers, the equation 92x? + 1 = y?. 


Solution. One readily observes that 92 x 1? + 8 = 10°. By samasa-bhavana, 
(1, 10; 8) © (1, 10, 8) = (20, 192; 8^). 


Dividing the consequent identity 92 x 20? + 8? = 192? by 8°, one obtains the 


CC-0. Gurukul Kangri Collection, Haridwar 


nm 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Kuttaka, Bhavana and Cakravala 179 


triple ($, 24; 1). Now 


oet 


an integer triple. Thus (120, 1151) is a solution of 92x? + 1 = y^. a 

The thrill of Brahmagupta at his discovery can be felt from the phrase he used 
after stating the equation 92x? + 1 = y?: kurvannávatsarád ganakah — “One who 
can solve it within a year [is truly a] mathematician.” 

Note that in this example m = 8 in the initial triple whereas Theorem 2 is a 
statement for m = +4. Through the example, Brahmagupta conveys to the reader 
that, by clever algebraic manipulations, Theorem | can be made to cover various 
cases where the initial triple is not necessarily of the form (p, q; 3-4). 

Another remarkable example of Brahmagupta ([Sha, Ch 18, verses 71—72]) is the 
equation 83x?-- 1 = y?. Applying Theorems | and 2 on the identity 83 x 1? —2 = 9?, 
one gets the solution (x, y) = (9,82) from which one can generate a sequence 
of successively larger solutions using Theorem 1; one of the solutions would be 
x = 175075291425879, y = 1595011813884802 (see [OR]). 

Through this example, Brahmagupta expects the reader to observe that since 
(+2)? = 4, successive applications of Theorems 1 and 2 on an integer identity 
Da? +2 = P? will give an integer solution to the equation Dx? + 1 = y?. A later 
mathematician Sripati clearly formulates ([Sih, p 40]; [DS2, p 157]) this immediate 
consequence of Brahmagupta’s results: From any positive integer solution of Dx* + 
m = y), where m € (—1, +2, +4}, one can derive a positive integer solution of 
Dx? + 1 = y? by repeated use of the samāsa-bhāvanā.*® 

As an illustration of the principle of generating larger roots of Dx? + m = y? 
from a given root, Brahmagupta mentions the equations 13x? + 300 = y? and 
13x? — 27 = y? ([Sha, Ch 18, verses 75]). By inspection, one has solutions (10, 40) 
and (6, 21) respectively. Also from the relation 13 x 1? — 4 = 3?, one gets the 
solution (180, 649) of 13x? + 1 = y? by Theorem 2. Now taking the composite 
of (10, 40; 300) with (180, 649; 1), one gets the larger solution (13690, 49360) of 
13x? + 300 = y?: further compositions will yield still larger solutions. Similarly, 
for 13x? — 27 = y?, one considers (6, 21; —27) © (180, 649; 1), (6, 21: 27) © 
(180, 649: 1) © (180, 649: 1), and so on. 

In section 1, we mentioned that it is now known that all positive integer solutions 


2 2 : - 1 , 
of Dx- + 1 = y? are given by (Xn, yn) satisfying y, + V Dx, = (xı + V Dxi) ? 
where (xi, yı) is the fundamental solution (i.e., the minimal positive integral so- 
lution). Once the fundamental solution is determined, one can see that the other 


win 


Ez = (1207 DIIS 


NI “U 


46This gives another strategy for solving Example 1: it is enough to solve 92x? + 4 = y? and hence 
23x? + 1 =z?; use 23 +2 = 5°. 
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solutions (x,, Yn) are given by Theorem 1. The sequence E In > 1 | forms a se- 


quence of "best" rational approximations to J D (the precise meaning will be stated 
in section 7). 


Some Perspectives on the Bhavana 


We now recall a few points that are further elaborated in the author's article [Dul ]. 


(i) Brahmagupta's very approach to the problem anticipates a modern trend 
(which perhaps began with Lagrange): the quest for general principles, the attempt 
to view a problem as part of a larger set-up. While trying to solve a specific hard 
problem Dx? + 1 = y? (in two variables), Brahmagupta undertook a bold and 
farsighted exploration of the general picture: the solution space of Dx? 4- m — y? 
(in three variables). In the process Brahmagupta discovers and extracts an important 
general and abstract principle (Theorem 1), and makes a clear enunciation of this 
principle. 

(ii) Brahmagupta envisages the key ingredient of a modern abstract structure: 
binary composition. If we exclude the four elementary arithmetic operations, the 
bhavand is perhaps the first conscious construction of a binary composition. Note 
that the binary operation is quite a complicated one: it involves two integral triples 
of unknown roots." Thus, in an attempt to solve an indeterminate equation in two 
variables, a seventh-century mathematician thought of constructing, what amounts 
to, an intricate abstract structure on the solution space of an equation in three 
variables.“ One does not see such an approach in mathematics during the next 1000 
years. The discovery of an algebraic structure, such as Brahmagupta's composition 
law, on a set of significance, is now an important theme in modern mathematics 
research. 


(iii) The power of the general composition principle of Brahmagupta can be seen 
from the ease with which it immediately provides the solution of a difficult equation 
like 92x? 4- 1 — y?. Ironically, this very simplicity tends to cloud the historian's 
perception of the true worth of Theorem 1. A casual observer may miss the richness 
and magnificence of ideas encapsulated in the 2-step solution of 92x? + 1 = y*. 
Further, once the mathematical community gets accustomed to an original idea 
like the bhāvanā, it becomes all the more difficult to fathom the greatness of the 
discovery. Historians need to be aware of this intrinsic risk of missing the real depth 
and significance of Brahmagupta's work on the varga-prakrti. 


"7Recall that even basic symbolic computations with unknown roors — treating them as if they 
were known quantities — is a fairly modern approach (from late 18th century) that emerged during the 
investigations on the general polynomial in one unknown. 

48The idea occurs in a fluid amorphous form; it had not been crystallised in a precise set-theoretic 
framework — the solution space was not presented as a single set. 
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(iv) The bhavana influenced, directly and indirectly, subsequent research on in- 
determinate equations by Indian algebraists. Apart from providing partial solutions 
to the original problem, the samdsa-bhavana contained the key to the subsequent 
complete solution by the cakravala algorithm as we will see in the next section. 

(v) Sometimes the brilliance of a particular algebraic research work lies in its 
opening up of new and unexpected hcrizons with immense possibilities through 
surprisingly simple innovations. Let us view the samdsa-bhavana in this light. Not 
only did Theorem 1 influence the complete solution to the original problem. it had 
opened up new possibilities whose true potential began to be harnessed only after 
its rediscovery in the 18th century. Theorem | turned out to be “a theorem of capital 
importance" (in the words of Euler*’). For instance, the samása-bhavaná © has an 
elegant interpretation in terms of the norm function — a very important concept 
in modern mathematics. Let A — Z [vD] = L tra DJa, b € z}), where Z 
denotes the set of integers. The norm function on A is the map N : A — Z defined 
by 


N (» Tx VD) = (» +x VD) (v — xv D) E y? —X2 D! 
Brahmagupta's identity may be reformulated as the statement: 
The norm function N is multiplicative, i.e., N(aB) = N(a)N(f) Va, B e A. 


Now the solution space S :— {(x, y, m) e Zx ZxZ| Dx?--m = y?°}is precisely the 
graph of N and we have a bijection f : $ > A defined by f(x, y, m) = y - x / D. 
Transferring the ring multiplication on A to S via f^! , we get the samasa-bhavand. 
Thus the exotic multiplicative structure © on S actually corresponds to the natural 
ring multiplication on A. 

It is tempting to surmise that Brahmagupta discovered Theorem 1 through an 
algebraic manipulation which was, in essence, the verification of the structure 
preserving property of the norm function — the natural multiplication in A providing 
the precise formula (see [Dul, p 99-101 J). 


(vi) In the language of quadratic forms, Theorem | says that two binary quadratic 
forms with a given discriminant D (say y? — Dx?, v? — Du?) can be composed” 
to yield another such form with discriminant D in a new pair of variables (xv + 


Euler highlighted the result as theorema eximium; see ((We2. p 204—205]). 

SÜThe theory of composition of quadratic forms, an important and rich topic initiated by Gauss 
and Dirichlet, is still an active area of research. In recent years, Manjul Bhargava discovered higher 
composition laws, a work which has many important ramifications. An account is given in Bhargava's 
article Higher composition laws and applications in the Proceedings of International Congress of 
Mathematicians (ICM), Madrid (2006). 
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yu, yo + Dxu). Theorem | can also be viewed as the result (see [Dul, p 83-84] for 
details): 


i ; 2 35 - RENE ; 
The binary form y^ — Dx“ is strongly multiplicative (over rationals). 


This version of Theorem 1 was generalised in 1965 by A. Pfister using, what are now 
called, “Pfister forms". Pfister's discovery opened up new directions in the theory 
of quadratic forms. Theorem 1 thus happens to be a starting point of a distinct area 
of research in algebra. In his 1990 research monograph [Oj] in this field, Manuel 
Ojanguren begins Chapter 5 by quoting Brahmagupta's original Sanskrit verses 
describing Theorem 1; the chapter itself is titled Also sprach Brahmagupta’'. 

(vii) While it is exciting to dwell on the implicit occurrence of modern principles 
inan ancient text, let us not lose sight of one aspect of the section Varga-prakrti: the 
sheer wizardry in classical algebraic manipulation at an early stage in the history of 
symbolic algebra. 

(viii) As argued in section 3 of our earlier paper ([Dul]), Brahmagupta's treat- 
ment of varga-prakrti has a unique pedagogic potential. Theorem 1 defines an 
intricate binary operation © on $ = ((x, y, m) e Z x Z x Z| Dx? + m = ye}. 
This sophisticated idea of constructing a binary composition on an abstractly de- 
fined unknown set is the quintessence of modern “abstract algebra”. Theorem 1 
and Example 1 can be effectively used to convey to a student the power of the con- 
cept “binary composition” and help him appreciate the essence of abstract algebraic 
ideas. Itcould be inspiring for the student to realise that the simplicity of the solution 
to Example 1 has its secret in the monoid structure of (S, ©) which in turn has its 
rootin the natural ring structure of the set A (defined above in (v)). As we mentioned 
in [Du1, p 88], “the unexpected invocation of an abstract algebraic technique for 
a concrete problem by an ancient mathematician could infuse in him [a student] a 
dynamic and creative vigour in his formal study of abstract algebra. Thus, through 
this deep but easily accessible work of a Master, the perceptive student can get 
an early exposure to modern sophistication, see a natural application of abstract 
algebraic principles, develop a flair for exploring worthwhile generalisation, and 
truly imbibe the spirit of the abstract algebra culture without getting swept away by 
its formalism.” 


(ix) Due to the paucity of adequate historical materials, it is often difficult to 
ascertain whether a particular result in a treatise was discovered by the author or by 
a predecessor. One does not know who was the true discoverer of the cakravála; 
even one cannot be sure whether Aryabhata was the pioneer of kuttaka. But the 
bhavand has the unmistakable stamp of Brahmagupta’s genius. One does not see 
any seed of this idea in the treatises of his predecessors Aryabhata and Bhaskara 


5!In English: Thus Spake Brahmagupta. 
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I or in the Baksali manuscript. None of the results in these remarkable texts come 
close to the algebraic sophistication shown by Brahmagupta's research on varga- 
prakrti. In several other results of Brahmagupta, one discerns a similar abrupt leap 
in mathematical maturity. 


7. Cakravala 


In the Laghu-Bhaskariya (Ch 8; verse 18) of Bhaskara I, there is a problem from 
astronomy involving the simultaneous quadratic indeterminate equations 8x + ] = 
y? and 7y? + 1 = z? ([Shu3, p 101 ]). By inspection, one can quickly arrive at the 
integral solution y — 3, z — 8 forthe second equation. In the context of this special 
equation, Udaydivakara, in his commentary on the Laghu-Bháskariya, refers to 
Jayadeva's general cakravála method for solving any equation Dx? + 1 = y’. 
Jayadeva's verses on the cakravála algorithm have been published in ([Shul]); a 
portion has been reproduced in the Appendix. Bhaskara II's verses on this algorithm 
occur in Bijaganita ([Ab. p 23]. [P, p 76]): they have been quoted in M.S. Sriram's 
article [Srr. p 167]. 


The Algorithm 


Recall that (p, q;71) denotes a triple satisfying Dp? + m = q?. The underlying 
idea of the algorithm can then be put as follows: from an integer triple (Pn, q,; Mn) 
such that |m, | is “small”, one has to construct an integer triple (Prn+1, dna 1: P151) 
with |77,,.;| "small" eventually arriving at an integer triple of the form (p, q: 1). It 
will be convenient for our discussions to assume that, at each stage, the integers 
p» and m, are mutually coprime. Observe that, if, in an integer triple (p, q; m), the 
integers p and m have a GCD g, then the relation Dp? -+m = q? will show that g is 


a factor of q and g? a factor of m and hence (2, a z) will also be an integer triple. 
Thus, replacing (p, q: m) by (2. 2 z) if necessary, we may ensure that p and m 
are relatively prime. 

Integer triples of the type (1, y; y? — D) were already “in the air" among Indian 
algebraists who expounded on Brahmagupta's work.” The authors of cakravala 
invariably begin with an initial triple (po, go; mo) of this type: po = 1, qo a positive 
integer for which |g9? — D| is minimum, and mo = qo? — D. 


S2éripati (and successive algebraists) clearly explains — what is tersely indicated by Brahmagupta 
— that applying bhavand on two copies of any triple (1, 9: q^ — D). one gets the rational solution 
x = ds. = G22 of Dx? +1 = y? and that infinitely many roots of this equation can be obtained 
by varying q and also by repeating the bhdvand ({Sih, p 40]; [DS2, p 153;150]). The possible impact of 
this observation on the discovery of the cakravála is discussed in [Du1, p 90-92; 96-97]. 
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We now explain the inductive step. Suppose that one has completed n steps and 
arrived at an integer triple (Pn, 4n) Mn) with p, and m, coprime. Composing this 
(known) integer triple (Pn, 4n} m, ) with the (variable) integer triple (1, y; y* — D) 
by samasa-bhàvaná, one gets an integer triple ( p, y 4- qn, DPn + quy: m,(y* — D)), 
that is, the identity 


D(Pny + dn y JF my? Dy (Dp, + qnyY.. 


Dividing the above identity by m,,7 (an idea which too could have arisen from the 
methods for obtaining rational solutions), one obtains a rational triple 


PnY da DPn+4ny. y- =) 


Mn My, My 


(Pn+ts dn 1H n41) = ( 


Now the identity qa? = D Pr + mp shows that 


Mn Pn+14n = (PnY + 9n)Qn = Pn(DPn + qn y) + ma. 


Suppose that pn+ı is an integer. As p, and m, are coprime, it would then follow 
that m, divides Dp, + q, y, that is, gn41 is an integer. It also follows that 7,4. (— 
Dp? ,, — qn+1°) is an integer. Further, one has the relation p.145 — Pn9nti = | 
which shows, by the argument made at the outset, that p,4; and m4; are 
coprime. 

Thus (Pn+1, Qn41;?1541) would be an integer triple if y can be so chosen that 
Pn+1 (and hence gn+1, 7,41) becomes an integer. This amounts to finding integer 
solutions of the equation m,x — pny = qn. AS we saw in section 5, the linear 
indeterminate equation had been extensively discussed by Indian algebraists from 
the time of Aryabhata (499 CE) and they knew the complete solution to the problem. 
Now among the infinitely many solutions, Jayadeva and Bhaskara II choose a 
solution for which |y? — D| would be minimum so that |71,..1| is minimised (as 
desired). 

It can be shown that the process terminates after a finite number of steps (that 
is, m, = 1 for some n), but the proof does not occur in any ancient text. Note 
that it is enough to arrive at a stage in which m, € {+1, +2, +4}; for, in that 
case, one can use Brahmagupta's formulae (Theorem 2) to arrive at a solution of 
Dx?--1 = y?. The ancient Indian authors assert that one will always arrive at such a 
stage. 

Starting with the initial triple (po, qo; mo), the cakravala (with or without the 
bhàvanà) invariably leads one to the fundamental solution of Dx? +1 = y 
mentioned in section 1 ([Sel, p 177]) and, as mentioned earlier, repeated applications 
of Theorem | then yield all integer solutions. 
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Quite often Brahmagupta’s result gives a very effective short-cut for numerical 
computations as the solution of the famous example of Bhaskara II ([Ab. p 24]: [P, 
p 77] shows: 
Example 2 (Bhaskara II). Solve, in positive integers, 61x? + 1 = y*. 
Solution. As 64 is the perfect square nearest to 61, we have the initial triple (1, 8: 3). 
Now one finds positive integer y for which uc is an integer and |y? — 61| is 
minimised. Clearly y — 7. Now 

74-8 61--8x7 T — 61 


a 5p) DO: =-4 


3 3 ` 3 


Thus we have the second triple (5, 39: —4). Now rather than continuing the 
cakravāla, one can apply Brahmagupta’s formula (Theorem 2) on the triple 
(5, 39; —4) to immediately get the minimum positive integral solution (226153980, 
1766319049) of 61x? + 1 = y?. H 

The article ((OR]) works out solutions to the examples of Narayana using the 
cakravála aided by the bhavand. The minimum positive integral solution for 103x7+ 
] = y? is (22419, 227528) and for 97x? + 1 = y is (6377352, 62809633). 


Application to Rational Approximation 


Narayana applies the cakravala-bhavana method of generating arbitrarily large 
solutions of Dx?--1 = y* to determine progressively better rational approximations 
of VD ([D1]; [Du], p 93; 102-3]). Note that, if Da? + 1 = b?, then 


b b? — Da? 1 


b Dios Dam 
a a(b--aV D)  a(b-- av D) 
and thus, for a sufficiently large solution (a, b), 2 will be a good approximation for 


VD. Narayana cites two numerical examples: 4/10 and um . We discuss the former; 


the case of n is similar. 

For 4/10, Narayana mentions the rational approximations B, E and Lm. To 
see how the fractions arise, consider the notation of section 6 for D — 10. Since 9 
is the perfect square nearest to 10, one has the initial triple (1, 3; — 1). Now, 


(1,3; —1) © (1, 3; —1) = (6, 19; 1), 
(6, 19: 1) O (6, 19: 1) — (228, 721; 1), 
(6, 19: 1) O (228, 721: 1) — (8658, 27379: 1). 
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Thus one has the three successive fractions 


19 721, 27379 
6 208 and $658 


tions to 4/10. The usefulness of the method can be seen from the fact that the third 
fraction 227? (= 3.162277662 . . . ) matches the value of 4/10 (= 3.162277660. ..) 
up to nine decimal digits. This method of generating successively closer approx- 
imations was restated by Euler in 1732 ({D1, p 188]). It is now known that if 
(x. Yn) | n > 1) is the sequence of positive integral solutions of Dx? + 1 = y?, 
then, among all fractions with denominator x, or less, ?* is the best approximation 
to V/D. This follows from a basic theorem in continued fractions ([BrC, p 399-401; 
535]: [G, p 63-64; 71—72]). In particular, Narayana’s approximations for 4/10 are 
the best possible (with the respective restrictions on the denominators). One can 
similarly see that the Su/ba approximations for 2, mentioned in section 1, are 
again the best possible. For the equation 2x? + 1 = y?, the minimum positive 
integral solution is clearly (2, 3). Now the bhavand relations 


as rational approxima- 


(2, 3:1) © (2, 3: 1) = (12, 17; 1) and (12, 17; 1) © (12, 17; 1) = (408, 577; 1) 


577 
408 


theory of continued fractions, that the Su/ba fraction i is the best approximation to 
V2 among all fractions with denominator < 5. 


yield the two Sulba fractions H and 35. It can also be seen, directly or from the 


Some Perspectives on the Cakravala 


We now make a few observations. 


(i) The cakravala algorithm demonstrates a marvellous interplay between the 
two great preceding works — the kuttaka and the bhàvana. As Weil observes ([We2, 
p 22]: 


As is the case with many brilliant discoveries, this one [cakravala] can be seen in retrospect 
as deriving quite naturally from the earlier work [samasa-bhavana]. 


The pedagogic potential of this development can hardly be overstated. A student can 
be shown an “illustrious example"? of how first-rate research flows: how Aryabhata 
seizes upon an ancient algorithm for computing the GCD to make a “descent” into 
the kuttaka algorithm for solving the linear Diophantine equation, a problem with 
applications in astronomy; how Brahmagupta approaches the harder problem of 
solving the Pell equation and comes up with a brilliant bhāvanā yielding a partial 
solution to the problem, and how Jayadeva (or a predecessor) assimilates both the 
ideas to arrive at the astonishing cakravala algorithm. 


53The allusion is to a statement of Leibniz quoted by Weil (We, p 229]; [Du1, p 88]). 
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(ii) Expositions on the cakravála often tend to identify it with the Brouncker- 
Euler-Lagrange procedure. But there is a difference. In the notation used above, the 
Brouncker-Euler-Lagrange method would amount to choosing quantities y < D 
for which D — y? is minimised (subject to conditions above), while the cakravala 
of Jayadeva and Bhaskara II stipulates that | D — y?| is minimised.^* The steps in the 
former method correspond to certain convergents of the standard continued fraction 
expansion of JD; the steps in the latter correspond to a generalised expansion 
where negative remainders could be involved. For instance, for the case D — 61(— 
7^ + 12 = 8? — 3), the starting point in the former procedure will be 7, while in the 
latter it will be 8 

The cakravála can also be interpreted in the framework of the regular (i.e., the 
usual) continued fraction expansion of VD (see [Sig]); its steps correspond to fewer 
convergents of the expansion than the steps of Brouncker's method. For instance, 
the first three terms of the continued fraction expansion of vona are 1 i and 2 
(with | < 2 < 6l < Ê); the cakravala involves only * and 2 MN desioa 
in section 5; fuo are naka that ancient Indian a had possibly 
discovered principles of continued fractions. The term cakravala (cyclic) could be 
an allusion to the periodicity of a continued fraction expansion of VD (see [Srl, p 
286]) or some equivalent concept. 

(iii) The descriptions of cakravála by Jayadeva and Bhaskara II begin with 
the inductive step of constructing (pj. 9n+13"n+1) from (Pn, qu: mj) without 
mentioning any starting point (po, go: o). Possibly the authors of cakravàála were 
flexible regarding the choice of the initial triple (po, qo; mo); any three convenient 
integers p, q,m satisfying Dp? + m = q? could be the starting point.?? They 
of course had an obvious choice for the initial triple, that is, a triple of the form 
(1, q: q? — D) which had already become common in the study of varga-prakrti. In 
the illustrative examples, they invariably display the triple (1, go: qo? — D) where 
qo is an integer y for which |y? — D| is minimum. As triples of the form (1, y; y? — 
D) were highlighted, in any case, as a central tool in the cakravála verses, a 
separate mention of (1, qo; qo? — D) was perhaps felt unnecessary.^* Incidentally 
the triple (1, 4: q? — D) finds explicit mention in Jayadeva's verse on the solution 
of Dx? +m = y? occurring immediately after his verses on Dx? +1 = y? 
([Shul, p 14]). 

Perhaps some of the writers of books on History of Mathematics, who erro- 
neously state that the cakravála involves “trial-and-error”, get confused by the 


4Narayana appears to differ slightly on this point. While he is silent on the minimisation in his 
description of the cakravála, in his examples, he chooses y for which |y — vD] is minimised: see [Sig]. 
S5Recall that in the kuttaka too there was a flexibility of stopping at some stage k (rather than 
completing the long division for computation of the GCD) if one could see, by inspection, the solution 


of some intermediary linear equation. 
Recall the culture of brevity in ancient Indian treatises discussed at the beginning of section 5. 
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apparent obscurity (in the translations of the verses) regarding the initial triple. 
They do not realize that triples (1, q; q? — D) were taken for granted by the ancient 
authors of cakravala.*” 


(iv) As observed by A.A. Krishnaswamy Ayyangar ([Krl, p 234 |), in retrospect, 
the original cakravála can be simplified by avoiding the kuttaka. Denote by Yn+ı 
the integer y which satisfies mn Ppn+1 — PnYn+1 = 4n- Thus 


Pn—1 Yn T dn-ı 1 Dpn-1 ar GQn—1 Yn 3 yas =D 
yh an = >; Mn = 


Mn-1 ny-| Mn-1 


so that p, y, — Gn = Pn—1M, and hence 
Pn Yn44 + dn Pnr(Yn+1 + Yn) A 
Pn SS = Se Pn jino | , 


My, My 


in particular, y,41 + Yn = 0 (mod m,,). Hence, instead of finding p,+) by kuttaka, 
one can construct y,+; inductively as follows: start with po = l, yo = qo a 
positive integer y for which |y? — D| is minimum, and mo = yo? — D; having 
constructed Yn, Pn, qj, Mn with Pn, dn, Mn being mutually coprime, choose y,,+1 to 
be a positive integer y satisfying the simple congruence y = — y, (mod m,,) for 
which |y? — D] is minimal. Then pp}; :— 70:2? — p, , is an integer; and, as 
before, the corresponding q,4.1, 7,41 will also be integers with p,41, Qn41; nl 
being mutually coprime. 

Simplifications are observed after initial breakthroughs. But, as mentioned ear- 
lier, there is a certain richness of thought in the ingenious approach through the 
blending of kuttaka and bhavana which would be good for students to imbibe. For 
the purpose of promoting the art of discovery, it is desirable to highlight the initial 
thought-process which actually achieved the breakthrough and which tends to get 
camouflaged in later simplifications. 


(v) As pointed out in [Kr1], the inductive step in the cakravala algorithm involves 
a transformation from one quadratic form to an equivalent quadratic form. 


(vi) Weil makes an incidental remark ([We2, p 51]): 


the Indians' love for large numbers may have supplied a motivation for their treatment of 
Pell's equation.?5 


57 Another source for the confusion could be that both Jayadeva and Bhaskara II first describe 
Brahmagupta's bhdvand method which is a statement about generating new solutions out of given 
solutions without saying how to get a given solution. 

58 is significant that this observation has been made by Weil who had considerable familiarity with 
ancient Indian literature. 
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. . . : : 5 
Indeed, Indians had a fascination for large numbers from the Vedic times." The 
facility with large numbers was conducive for the Indians in their research on the 
Pellian equation, which soon involves large solutions. 


(vii) The early occurrence of Pell’s equation in Indian history can also be seen 
in the light of an observation of Legendre ([G, p 3]): 


... it appears that the ancient philosophers” did rather extensive research on the properties 
of numbers. But they lacked two tools needed to reach the depths of this science: the art of 
numeration, which serves to express numbers with great ease, and algebra, which generalizes 
results and can operate equally on the known and the unknown. The invention of both these 
arts greatly influenced the progress of the science of numbers. 


Greeks had generally neglected computational mathematics and it is significant 
that Archimedes, who had suggested a problem which was later seen to involve 
Pell's equation, also happened to be the one Greek mathematician who had re- 
alised the importance of an efficient system of enumeration and had tried to evolve 
one. 

While the “art of numeration” had been perfected in India well before the 
time of Aryabhata, the Indian algebraists from Brahmagupta onwards had also 
become adept in the art of formation and clearance of equations. Brahmagupta 
used the term samikarana (making equal) for equation, varna (colour-names) for 
unknowns and rapa (appearance) for coefficients, described a plan for writing equa- 
tions and then preparing them, and classified equations. Further details are given 
in [DS2]. 

(viii) Perhaps it can never be ascertained whether ancient Indian algebraists had 
conceived of a proof*! that the cakravala method always works. Even if they had a 
proof, given the technicalities involved, it would have been difficult to express it in 
those days. After all, even Fermat had not written (or is not known to have written) a 
proof, although he claimed that he had a proof by “descent”. While analyzing what 
might have been Fermat's proof, Weil provides a valuable insight into Fermat's 
possible difficulty in formulating it ((We2, p 99]): 


It does not seem unreasonable to assume that this may have been, in general outline, what 
Fermat had in mind when he spoke of his proofs in his letters to Pascal and Huygens; how 
much of it he could have made explicit must remain a moot question. Perhaps one of his 


594 verse of Medhátithi in the Vajasaneyi Samhita (X VII.2) of the Sukla Yajurveda gives number- 
names for each power of ten up to 107; much larger numbers are given in the epic Ramayana and the 
Jaina and Buddhist literature. For a few examples, see R.C. Gupta, World's longest lists of decuple terms, 
Ganita Bharati Vol 23 (2001), p 83-90. 

697 egendre was referring to ancient Greeks; he was not aware of ancient Indian research in number 
theory. 

61 Ancient Indians had the notion of upapatti which corresponds to “proof”; see the article by M.D. 
Srinivas ([Srn]) for a comparison of the two concepts. 
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most serious handicaps was the lack of the subscript notation; this was introduced later by 
Leibniz, in still imperfect form, and its use did not become general until rather late in the 
next century. Faced with a difficult or complicated proof, Fermat might well be content with 
a careful analysis of some typical numerical cases, convincing himself at the same time that 
the steps involved were of general validity. Actually he never succeeded in writing down his 
proofs. 


Now the handicaps of Fermat applied all the more to Jayadeva, Bhaskara II and 
their predecessors who had no better writing-aid than perishable palm-leaves. Also 
note that the theoretical justification for cakravála is more complicated. Besides, 
as mentioned at the beginning of section 5, the great Ácáryas (Masters) in ancient 
Indian tradition usually stated only the essence of their results (and that too in verse 
form). Details were left for oral transmission; but the unfortunate break in oral 
tradition, brought about by foreign invasions, resulted in the loss of significant links 
in mathematical and scientific knowledge. But the very nature of the cakravála (and 
the term itself) suggests that, like Euler, the Indians must have had a deep insight 
into the problem if not a complete proof. 


The history of ancient Indian algebra, especially that of the kuttaka, the bhavana 
and the cakravala, is a grand illustration of the following remark of Sri Aurobindo 
([A, p 185]): 


Especially in mathematics, astronomy and chemistry, the chief elements of ancient science, 
she [India] discovered and formulated much and well and anticipated by force of reasoning 
or experiment some of the scientific ideas and discoveries which Europe first arrived at much 
later, but was able to base more firmly by her new and completer method. 


8. Brahmagupta as a representative of The Classical Age 


The kuttaka, the bhavand and the cakravala are fruits of the Classical Age” in Indian 
history, a period of intellectual flourish, a period of a magnificent efflorescence in 
myriad fields of human endeavour. To a thinker who is aware of the spirit of the 
times, the Indian achievements on algebra in general, and quadratic indeterminate 
equations in particular, do not come as a surprise.? Thus a familiarity with the broad 
features of the cultural history of ancient India is desirable for fresh thinking on any 
topic in ancient Indian science. A deep insight into the history can be had from Sri 
Aurobindo's essays in [A]. 


62The “Classical Age” usually refers to the Gupta period (beginning from 320 CE), the period often 
identified as the “Golden Age” of post-Vedic Indian history. Many features of the Classical Age can be 
seen in the history of the preceding centuries, from the time of Mauryas (from around 300 BCE) at least. 

$3 Unfortunately, there have been scholars from both India and the West who have nurtured the image 
of the ancient Indian mind as being exclusively other-worldly. 
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The early epoch of the Veda and the Upanisads, luminous with the discovery of 
the Spirit, was the “heroic creative seed-time" ({A, p 110]) — the first formative 
period of the deeper and finer aspects of the Indian civilisation. This inspired period 
of great spiritual outflowering passed away into an age of strong intellectuality. In 
the words of Sri Aurobindo ([A. p 148]): 


The second or post-Vedic age of Indian civilisation was distinguished by the rise of 
the great philosophies, by à copious, vivid, many-thoughted, many-sided epic literature, 
by the beginnings of art and science, by the evolution of vigorous and complex society, by 
the formation of large kingdoms and empires, by manifold formative activities of all kinds 
and great systems of living and thinking. Here as elsewhere, in Greece, Rome, Persia, China, 
this was the age of a high outburst of the intelligence working upon life and the things of the 
mind... 


It was a birth time and youth of the seeking intellect. . . . 


In the Classical Age, each department of knowledge or activity, once taken up, 
was pursued to its extreme with an amazing intensity, a powerful, penetrating and 
scrupulous intelligence. Sri Aurobindo explains ({A, p 402—403]): 


For the third power of the ancient Indian spirit was a strong intellectuality, at once austere 
and rich, robust and minute, powerful and delicate, massive in principle and curious in 
detail. ... 

The mere mass of the intellectual production during the period from Asoka well into the 
Mahomedan epoch is something truly prodigious, as can be seen at once if one studies the 
account which recent scholarship gives of it, and we must remember that that scholarship as 
yet only deals with a fraction of what is still lying extant and what is extant is only a small 
percentage of what was once written and known... . 


In each subject from the largest and most momentous to the smallest and most trivial there 
was expended the same all-embracing, opulent, minute and thorough intellectuality . . . 


Brahmagupta is a striking embodiment of this vibrant intellectuality. This ver- 
satile genius systematises arithmetic, lays the foundation of algebra, takes the 
amazing leap into Pell's equation, records an ingenious construction of rational 
cyclic quadrilaterals and the formulae for its area and diagonals, gives second 
order interpolation formula, not to mention the numerous contributions in astron- 
omy. His approach and attitude to mathematical discoveries reveal the mind and 
spirit of a great pure mathematician enthusiastically engaged in vigorous pursuit 
of new results inspired by a feel for their intrinsic mathematical worth irrespec- 
tive of the requirements of immediate applications. His mathematical foresight can 
be seen in his realisation of the importance of “algebra” as a distinct branch of 
mathematics. 

Just as Kalidasa is the great representative poet of the Classical Age, Brah- 
magupta is its great representative mathematician; the bhdvand shining as a crest- 
jewel among his mathematical gems. 
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9. Varga-prakrti: Motivation and Allied Issues 


Brahmagupta's possible motivation for investigating the difficult equation Dx? + 
1 = y? in early seventh century was discussed in chapter 7 of our earlier paper 
([Dul, p 101—104]). In this section we shall examine a few passages from the book 
[Vn] of van der Waerden™, which represent an approach to the Indian history on 
Pell's equation that is often made by scholars. van der Waerden asks ([Vn, p 148]): 


Brahmagupta was mainly an astronomer. .. . But why should he be interested in the solution 
of Pell’s equation? 


The answercan be found in Brahmagupta's text itself. As mentioned earlier (section 
2), towards the end of chapter 18, Brahmagupta gives sukhamdatram as the raison 
d'étre for the algebra problems discussed in the chapter: 


These questions are stated simply for delight. 


We mention, in this context, that in a lecture on as broad a topic as Mathematics as 
a basic science, Michael Atiyah® observed”: 


Number Theory for its own sake, as a great intellectual challenge, has a long history, particu- 
larly here in India. Already in the 7th century, Brahmagupta made important contributions to 
what is now known (incorrectly) as Pell's Equation. 


As hinted in section 8, Brahmagupta belonged to a phase of Indian history where 
one sees an insatiable curiosity and an overflowing joy of creation supported by a 
vigorous intellectual originality. As Sri Aurobindo observes ([A, p 401 ]): 


When we look at the past of India, what strikes us next is her stupendous vitality, her 
inexhaustible power of life and joy of life, her almost unimaginably prolific creativeness. 


648 L. van der Waerden (1903-96) was a leading Dutch mathematician of the 20th century. The 
world of mathematics is indebted to him for his influential two-volume treatise Moderne Algebra (1930) 
which became the model for the groups-rings-fields approach of text books on abstract algebra. This 
book, based on the lectures of Emmy Noether and Emil Artin, made a systematic presentation of a large 
body of research by Richard Dedekind, David Hilbert, Emmy Noether and Emil Artin. van der Waerden 
isrecognised as having contributed most substantially to the promulgation of the ideas of Emmy Noether, 
the founder of modern "Abstract Algebra". He also did important research in algebraic geometry and 
various other branches of mathematics. He was also a prominent researcher in history of astronomy and 
mathematics. Apart from his long-time interest in Greek mathematics, he had developed a deep interest 
in Babylonian astronomy and mathematics; his book Science Awakening (1954) contains his thoughts 
on the possible flow and modification of ideas from Babylonians to Pythagoreans. 

650ne of the greatest mathematicians of our time, Michael Atiyah (b. 1929) was awarded the Fields 
Medal in 1966 and the Abel Prize in 2004, the two highest awards in mathematics. 

66See Current Science, Dec.1993, p 913. 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Kuttaka, Bhavana and Cakravála ] R 3 & ?1 193 

Brahmagupta's delight in performing new feats'in algebra was in tune with 
the general creative impulse in the cultural atmosphere. After bringing clarity and 
refinement in Aryabhata's kuttaka, the pure mathematician's impulse drove him 
to take up the harder problem of the Pellian equation. Note that his penchant 
for number-theoretic problems also finds expression in his chapter on arithmetic 
where he blends his geometric and algebraic skill to give an ingenious construction 
of a cyclic quadrilateral whose sides, diagonals, circum-radius and area are all 
rational and whose diagonals are perpendicular to each other (and other related 
problems). While stating concrete examples, especially those involving the varga- 
prakrti, Brahmagupta often used the phrase kurvannavatsarad ganakah — One 
who can solve it within a year (is truly a) mathematician. Clearly he revelled in the 
challenge posed by the varga-prakrti, the determination of a solution of which was 
described by Jayadeva ([Shul, p 14—15]) as being as difficult as setting a fly against 
the wind. 

van der Waerden overlooks all such internal evidence. He writes ([ Vn, p 148]): 


The Greeks had a motive for occupying themselves with Pell's equation. The Pythagoreans 
wanted to find rational approximations for what we call /2, and they found the “Side- and 
Diagonal-Numbers". Archimedes needed approximations for v3, and he used for this purpose 
solutions of the equations x^ — 3y? + Land x? = 33? — 2. 


Missing from van der Waerden's discussion is the fact that, right from the Vedic 
times, there have been efforts by Indian mathematicians to obtain convenient, rea- 
sonably accurate, rational approximations to irrational surds VD. The approxi- 
mation 1 + 1 + 545 — sq (= $5) for V2 was known in the Vedic era and 
recorded by three authors of the Sulba-sutras — Baudhayana, Apastamba and 
Katyayana.°’ As we mentioned in section 1, the denominator and numerator of 
the fraction 37 satisfy the Pell equation 2x? + 1 = y?. A few examples of 
rational approximations occurring in post-Vedic texts were mentioned in ([Dul, 
p 102-103]). As mentioned in section 7, the algebraist Narayana explicitly used 
large solutions of Dx? + 1 = y? for approximating vD. None of these facts find 
mention in van der Waerden's book published in 1983. van der Waerden continues 


([Vn, p 149; 154]): 


We also have seen that the methods for obtaining these solutions are closely connected with 
the Euclidean algorithm, applied to irrational ratios of line segments. ... But why should 
Brahmagupta explain these methods, stripped of their original context, in an astronomical 


S7I as early a text as that of Dickson ({Di, p 341]), the Sulba fractions giving approximations to V2 
are mentioned at the beginning of the chapter "Pell equation". 

6 4 Iso recall (section 7) that the pair (408, 577) is related to the minimal solution (2, 3) by Brah- 
magupta's identity (2, 3: 1) © (2, 3: 1) © (2, 3; 1) © (2, 3; 1) = (408, 577; 1). 
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treatise? I can find only one explanation: he followed an earlier tradition ultimately derived 
from Greek sources. 


I suppose that the Greeks were able to solve Pell’s equation, not only for D = 2 and D = 3, 
but also for higher values of D, by a systematic application of the Euclidean algorithm. I also 
suppose that their methods of calculation were copied, without proofs, in Hindu treatises like 
Brahmagupta’s Siddhanta. 


Now this is the analysis of a scholar as erudite and meticulous as van der Waerden. 
What goes wrong? This is not an isolated instance. P. Tannery asserted in 1882 that 
Indians must have got the solution of Pell’s equation from Greeks (who must have 
solved it before), several historians repeated this assertion in some form or the other, 
and, a hundred years after Tannery, the writings of van der Waerden (1983) convey a 
similar impression.” Perhaps, as hinted in section 8, consciously or subconsciously, 
a topic like Pell’s equation was considered to be too sophisticated for any ancient 
mathematician hailing from, what was perceived to be, an other-worldly culture. In 
this connection, we make a few remarks. 


(i) Like Archimedes before him, or Newton and Gauss after him, Brahmagupta 
was a veritable colossus of his time. This perception is yet to sink in, in the circle 
of history of mathematics. Anyone familiar with his mathematical gifts would not 
need to postulate a preceding hypothetical text from where he could have "copied" 
(“without proof”!) Theorem 1. It is a reflection of the opulent intellectuality and 
the inexhaustible vital creativeness of the Classical Age that Brahmagupta and 
some of his successors were doing prolific research in astronomy as well as in pure 
mathematics with equal vigour and enthusiasm. 


(ii) The Indian achievements on Pell’s equation tend to overshadow the fact that 
ancient Indian algebraists had produced a large bulk of work involving ingenious 
solutions of various types of indeterminate equations (see [DS2] for some examples). 
Most of these equations were clearly investigated for their own sake. While this 
zest for exploration of higher degree indeterminate equations (after solving the 
application-oriented linear case) fits into the general spirit of the era, it was also 
part of a process of infusing a strong algebra-culture in mathematical thinking. 
We have seen in section 2 that astronomer-mathematicians like Brahmagupta and 
Bhaskara II, and mathematicians like Narayana, who pursued or discussed the 
equation Dx? + 1 = y?, explicitly emphasised the importance of algebra in their 
respective texts. Their statements indicate their passion for algebra as also the 
perception that the cultivation of the subject was a means for sharpening and 
enriching the intellect. 


in ([Sr2, p 17-18]), R. Sridharan quotes similar statements from a paper of van der Waerden as an 
example of the prejudice of some mathematicians. 
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Just as Fermat made use of Pell’s equation to create interest in number theory. 
Brahmagupta and Bhaskara II too made use of challenging problems in indeter- 
minate equations (among other topics) to promote algebra among students with 
promising intellects. 

(iii) In the context of the last sentence quoted above from van der Waerden, 
we note that proof (or its absence) can be an issue in the context of cakravála, 
not bhavand. Brahmagupta's section Varga-prakrti consists of Theorem | which 
is, in essence, an algebraic identity, and its repeated applications. All these results, 
once stated, can be verified by routine algebraic operations; and, as explained at 
the beginning of section 5, routine details are not to be expected in a terse (but still 
voluminous) original ancient Indian treatise. Readers can judge the validity of the 
quoted speculation by seeing the phrase about some Greek methods being copied 
without proofs in Brahmagupta's treatise in the light of the actual contents of the 
treatise. 

(iv) We have seen that Indian history on Pell's equation shows a gradual devel- 
opment: first there is kuttaka; then the composition law, rational solution and partial 
integer solution to Pell's equation; and then the complete integer solution (involving 
all preceding ideas). It is generally agreed that Greek mathematics had stagnated 
by the 5th-6th century CE; that is before Brahmagupta. If the Greeks had solved 
the problem before Brahmagupta, and the solution had reached India, why does 
Brahmagupta record (or *copy") only a partial solution? Or should one presume 
that, by some combination of circumstances, only an initial part of the hypothetical 
Greek treatment on Pell's equation had reached Brahmagupta, and a subsequent 
part reached Jayadeva after a few centuries? 

We mention here that it is only in certain aspects of astrology and astronomy that 
historians have traced a possible transmission of ideas from Greek (or Hellenistic) 
schools to the Indian treatises. There is no influence of the “pure mathematics" of 
the Greeks in the developments in post- Vedic Indian mathematics. Now if there has 
not been transmission of something as standard as Greek geometry, what were the 
chances of transmission of a topic as esoteric (for the times) as the integer solutions 
of the Pellian equation from the hypothetical Greek sources (and that too in at least 
two phases — one for Brahmagupta, one for Jayadeva)? 


On the one hand, there is concrete textual evidence of research on Pell's equation 
progressing in phases in ancient India in an environment with a good system of 
enumeration and a clarity regarding formation and handling of equations, with 


70Indians had a strong culture of systematisation of various branches of knowledge. Mathematicians 
belonging to an intellectual environment that was used to the structure of Panini’s Astüdhyayr would 
have found the postulational style of Greek geometry suited to their temperament, had they got a glimpse 
of it. Someone like Brahmagupta, who could come up with gems on the cyclic quadrilateral, would have 
much more geometry in his treatise if only Greek geometry had reached him. The efforts of Mahavira 
show that Indians did have curiosity regarding conics. 
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a love for large numbers and a love for algebra (cf. (vi) and (vii) of section 7). 
On the other, there is as yet no concrete evidence of any Greek mathematician 
even investigating integer solutions of Pell’s equation for large D and no serious 
evidence of transmission to India of mathematical ideas that were not intimately 
connected to astronomy. And yet, there has been a tendency to deny the originality 
of Brahmagupta and his successors regarding their work on varga-prakrti, or to 
label it, euphemistically, as “disputed”. As H.C. Williams recently put it ([Wi, p 
404]): 


Finally, there is the belief, apparently due to Tannery, that the cyclic method derives from 
Greek influences. There seems, apart from possible wishful thinking on the part of Tannery, 
to be little solid evidence in support of this. The simple fact is that, as mentioned earlier, 
we don't really know what the Greeks knew about the Pell equation. What we do know, 
however, is that the Indian methods display a history of steady development and refinement 
up to and including the discovery of the cyclic method, and this very strongly suggests 
that Hankel's position that the Indians evolved the technique by themselves is the correct 
one. 


Itis to be hoped that the seminar series on History of Mathematics at the Chennai 
Mathematical Institute will be a precursor to more such congregations in India of 
leading mathematicians and historians of mathematics, and that such interactions 
will pave the way for a more accurate, more comprehensive, more representative 
and more insightful account on history of mathematics in the near future. 


Appendix: The Original Verses 


To save space, we refrain from quoting original verses for all the statements from 
ancient Indian treatises referred to in our article. We quote only the earliest extant 
verses which describe the three principles kuttaka, bhāvanā and cakravála. English 
translations forthe quoted verses can be found in [ShuS, p 74—84] and [Ke, p 128] (for 
kuttaka), [Du1, p 107-112] (for bhavand) and [Shul, p 10-11] (for cakravala). As 
mentioned at the beginning of section 5, most of these verses are very terse. Faithful 
translations in English usually require substantial supplementary explanatory notes, 
apart from a liberal use of brackets for the translations to make sense. Among the 
famous ancient Indian treatises, perhaps the most lucid exposition of the three 
principles occurs in the Bijaganita of Bhaskara II. 


Aryabhata's verses on Kuttaka 
adhikaügra bhàgahàram chindyádünagra bhagaharena 
Sesaparaspara bhaktam matigunamagrantare ksiptam 


adha upari gunitamantyayugünagra ccheda bhajite Sesam 
adhikagra ccheda gunam dvi cchedagram adhikagrayutam 
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Brahmagupta's verses on Bhavana 


mülarn dvidhestavargád gunakagunadista yutavihinacca 
üdyavadho gunakagunah sahantyaghátena krtamantyam 
vajravadhaikyam prathamam praksepah ksepavadhatulyah 


Jayadeva's verses on Cakravala 


hrasvajyesthaksepan pratirasya ksepabhaktayoh ksepat 
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kuttākāre ca krte kiyadgunam ksepakarn ksiptva 
tüvatkrteh prakrtyá hine praksepakena sarhbhakte 
svalpataravaptih syádityákalito'parah ksepah 
praksiptapraksepakakuttakare kanisthamülahate 
sajyesthapade praksep(ak)ena labdharn kanisthapadam 
ksiptaksepakakuttagunitáüttasmatkanisthamülahatam 
páscátyarn praksepam visodhya S$esarh mahanmülam 
kuryat kuttàkárarn punaranayoh ksepabhaktayoh padayoh 
tatsestahataksepe sadrSagune’smin prakrtihine 
praksepah ksepápte praksiptaksepakácca gunakarat 
alpaghnat sajyesthat ksepavaptam kanisthapadam 
etatksiptaksepakakuttakaghatadanantaraksepam 
hitvà'Ipahatarn Sesam jyestham tebhyasca gunakadi 
kuryáttavadyávat sannámekadvicaturnam patati 

iti cakravalakarane’vasarapraptani yojyani 
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Development of Calculus in India 


K. Ramasubramanian and M. D. Srinivas 


In this article we shall present an overview of the development of calculus in Indian mathematical 
tradition. The article is divided naturally into two parts. In the first part we shall discuss the de- 
velopments during what may be called the classical period, starting with the work of Aryabhata 
(c. 499 CE) and extending up to the work Narayana Pandita (c. 1350). The work of the Kerala 
School starting with Madhava of Sangamagrama (c. 1350), which has a more direct bearing 
on calculus, will be dealt with in the second part. Here we shall discuss some of the contributions 
of the Kerala School during the period 1350-1500 as outlined in the seminal Malayalam work 
Yuktibhasa of Jyesthadeva (c. 1530). 


PART I : THE CLASSICAL PERIOD 


Aryabhata to Narayana Pandita (c. 500-1350 CE) 


1. Introduction 


In his pioneering history of calculus written sixty years ago, Carl Boyer was to- 
tally dismissive of the Indian contributions to the conceptual development of the 
subject.! Boyer's historical overview was written around the same time when (i) 
Ramavarma Maru Thampuran and Akhileswarayyar brought out the first edition 
of the Mathematics part of the seminal text Ganita-yukti-bhasa, and (ii) C.T. Ra- 
jagopal and his collaborators, in a series of pioneering studies, drew attention to 
the significance of the results and techniques outlined in Yuktibhasa (and the work 
of the Kerala School of Mathematics in general), which seem to have been forgot- 
ten after the initial notice by Charles Whish in early nineteenth century. These and 
the subsequent studies have led to a somewhat different perception of the Indian 
contribution to the development of calculus as may be gleaned from the following 
quotation from a recent work on the history of mathematics: 


IC, B. Boyer, The History of the Calculus and its Conceptual Development, Dover, New York 
1949, pp. 61—62. 

?L. H. Hodgekin, A History of Mathematics: From Mesopotamia to Modernity. Oxford 2005, 
p. 168. 
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We have here a prime example of two traditions whose aims were completely 
different. The Euclidean ideology of proof which was so influential in the Is- 
lamic world had no apparent influence in India (as al-Biruni had complained long 
before), even if there is a possibility that the Greek tables of ‘trigonometric func- 
tions” had been transmitted and refined. To suppose that some version of 'calcu- 
lus” underlay the derivation of the series must be a matter of conjecture. 

The single exception to this generalization is a long work, much admired in 
Kerala, which was known as Yuktibhasda, by Jyesthadeva; this contains some- 
thing more like proofs—but again, given the different paradigm, we should be 
cautious about assuming that they are meant to serve the same functions. Both 
the authorship and date of this work are hard to establish exactly (the date usu- 
ally claimed is the sixteenth century), but it does give explanations of how the 
formulae are arrived at which could be taken as a version of the calculus. 


The Malayalam work Ganita-yukti-bhasa (c. 1530) of Jyesthadeva indeed 
presents an overview of the work of Kerala School of mathematicians dur- 
ing the period 1350-1500 CE. The Kerala School was founded by Madhava 
(c. 1340-1420), who was followed by the illustrious mathematician-astronomers 
Parame$vara (c. 1380-1460), his son Dàmodara and the latter's student 
Nilakantha Somayaji (c. 1444-1550). While the achievements of the Kerala 
School are indeed spectacular, it has now been generally recognised that these are 
in fact very much in continuation with the earlier work of Indian mathematicians, 
especially of the Aryabhatan school, during the period 500-1350 CE. 

In the first part of this article, we shall consider some of the ideas and meth- 
ods developed in Indian mathematics, during the period 500-1350, which have a 
bearing on the later work of the Kerala School. In particular, we shall focus on the 
following topics: Mathematics of zero and infinity; iterative approximations for 
irrational numbers; summation (and repeated summations) of powers of natural 
numbers; use of second-order differences and interpolation in the calculation of 
jya or Rsines; the emergence of the notion of instantaneous velocity of a planet in 
astronomy; and the calculation of the surface area and volume of a sphere. 


2. Zero and Infinity 
2.1. Background 


The santi-mantra of Isavasyopanisad (of Sukla-yajurveda), a text of Brah- 
mavidya, refers to the ultimate absolute reality, the Brahman, as pürna, the per- 
fect, complete or full. Talking of how the universe emanates from the Brahman, 
it states: 


quiza: wae wed 
quier yaaa orario i 


That (Brahman) is purna; this (the universe) is pürna; [this] pürna emanates 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Development of Calculus in India 203 


from [that] pürna; even when pürna is drawn out of piirna, what remains is 
also purna. 


Panini’s Astadhyayz (c. 500 BCE) has the notion of lopa which functions as a 
null-morpheme. Lopa appears in seven sutras of Chapters 1, 3, 7, starting with 


pe + 
aya 9[4 AT: 1(1.1.60). 


Sanya appears as a symbol in Pingala's Chandah-sütra (c. 300 BCE). In Chapter 
VIII, while enunciating an algorithm for evaluating any positive integral power 
of 2 in terms of an optional number of squaring and multiplication (duplication) 
operations, suinya is used as a marker: 


- = 
SY TAH! FZ: TAA (8.29-30). 


Different schools of Indian philosophy have related notions such as the notion of 
abhava in Nyaya School, and the siinyavdda of the Bauddhas. 


2.2. Mathematics of zero in Brahrmasphuta-siddhanta (c. 628 CE) of 
Brahmagupta 


The Brahmasphuta-siddhanta (c. 628 CE) of Brahmagupta seems to be the first 
available text that discusses the mathematics of zero. Stinya-parikarma or the 
six operations with zero are discussed in the chapter XVIII on algebra (kuttaka- 
dhyàya), in the same six verses in which the six operations with positives and 
negatives (dhanarna-sadvidha) are also discussed. Zero divided by zero is stated 
to be zero. Any other quantity divided by zero is said to be taccheda (that with 
zero-denominator):? 
ce Eres T 

Roel WD Vni: TAH | 

ECEPIPEZIEEIS Ud WaT gD Taf AAT 

et TEM CATS Mt UAH Hah 


3 Brahmasphutasiddhanta of Brahmagupta, Ed. with his own commentary by Sudhakara 
Dvivedi, Benaras 1902, verses 18.30-35, pp. 309-310. 
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WET VA ar Tare VTA ari 

AMAA: GC wea ud pad adi 

... [The sum of] positive (dhana) and negative (rna), if they are equal, is zero 
(kham). The sum of a negative and zero is negative, of a positive and zero is 
positive and of two zeros, zero (Stinya). 

... Negative subtracted from zero is positive, and positive from zero is negative. 
Zero subtracted from negative is negative, from positive is positive, and from zero 
is zero (akasa). ... The product of zero and a negative, of zero and a positive, or 
of two zeroes is zero. 


...A zero divided by zero is zero. ... A positive or a negative divided by zero is 
that with zero-denominator. 


2.3. Bhaskaracarya on Khahara 


Bhaskaracarya II (c. 1150), while discussing the mathematics of zero in Bzja- 
ganita, explains that infinity (ananta-rasi) which results when some number is 
divided by zero is called khahara. He also mentions the characteristic property of 
infinity that it is unaltered even if *many' are added to or taken away from it, in 
terms similar to the invocatory verse of [savasyopanisad mentioned above:4 


WI Had Va APS ufu 1 
fey Red wowgd du wo semen ad ae ud wi 
Waa ao Uf: Get: Sequi 


wren: wert a carat wreaths gery 
Teale varese sas TTI Wu 
- BJ La o hj 

A quantity divided by zero will be (called) khahara (an entity with zero as di- 
visor). Tell me ...three divided by zero ... This infinite (ananta or that without 
end) quantity 5 is called khahara. 

In this quantity, khahara, there is no alteration even if many are added or taken 
out, just as there is no alteration in the Infinite (ananta), Infallible (acyuta) 


[Brahman] even though many groups of beings enter in or emanate from [It] at 
times of dissolution and creation. 


2.4. Bhaskaracarya on multiplication and division by zero 


Bhaskaracarya while discussing the mathematics of zero in Lilavatz, notes that 
when further operations are contemplated, the quantity being multiplied by zero 
should not be changed to zero, but kept as is. Further he states that when the quan- 
tity which is multiplied by zero is also divided by zero, then it remains unchanged. 


^ Bijaganita of Bhaskarácárya, Ed. by Muralidhara Jha, Benaras 1927, Vasanà on 
Khasadvidham 3, p. 6. 
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He follows this up with an example and declares that this kind of calculation has 
great relevance in astronomy: 


a ara antet @ ama fa: | 
und “Ea WU nave CREE 


TÀ ord Gm ub aaa STAT TAT: | 
frre EIEUESPESCEHEDER GE 


a wqur fe ae cen ad 
HG wd ATS urs Wl 
Ag cup a m wn faTi- 


qafatrantia: aga AÈ: u 
Fae OFT: erem qur orum AT: P/Q: ad FX: 


olga | 
qat aam faermfafr4 season a cred us zi 
are forenses eA zer: i 


.A quantity multiplied by zero is zero. But it must be retained as such when 
further operations [involving zero] are contemplated. When zero is the multiplier 
of a quantity, if zero also happens to be a divisor, then that quantity remains 
unaltered . . . 

.. What is the number which when multiplied by zero, being added to half of 
itself multiplied by 3 and divided by zero, amounts to sixty-three? 

.. Either following the inverse process or by choosing a desired number for the 
unknown (‘rule of false position’), the quantity is obtained to be 14. This kind of 
calculation is of great use in mathematical astronomy. 


Bhaskara works out his example as follows: 


| (x +5) x ] = 63 
= za c (m 
Therefore, X Ee REIS (1) 


Bhàskara, it seems, had not fully mastered this kind of “calculation with infinites- 
imals" as is clear from the following example that he presents in Bijaganita while 
solving quadratic equations by eliminating the middle term: 


5 pilàvatiof Bhaskaracarya, Ed. by H. C. Bannerjee, Calcutta 1927, Vāsanā on verses 45—46, 
pp. 14-15. 
6 Bijaganita, cited above, Vàsanà on avyaktavargadi-samikaranam 5, pp. 63-64. 
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e wanbgfedt ufsr want aft ga: | 
ana uoTgeN Har: TIRATE | 


Say what is the number which when added to half of itself, multiplied by zero, 


squared and the square being augmented by twice its root and divided by zero, 
becomes fifteen? 


Clearly the problem as stated is 


[0G 4- 5] 4-2 x [0(x 4- 3] 


EMI 2 
7 15 (2) 


Bhaskara in his Vasana seems to just cancel out the zeros without paying any 
heed to the different powers of zero involved. He converts the problem into the 
equation 


[x+ 3] 2x ez] 2 is. G3) 


Solving this, by the method of elimination of the middle term, Bhaskara obtains 
the solution x = 2. The other solution C£) is not noted. 


3. Irrationals and iterative approximations 


3.1. Background 


Baudhāyana-śulva-sūtra gives the following approximation for 4/2: 


wat qd de waa | aaa: | 


The measure [of the side] is to be increased by its third and this [third] again by 


its own fourth less the thirty-fourth part [of the fourth]. That is the approximate 
diagonal (savisesa). 


1 1 
2 s -5 s 
v2 7*3 s+ 3.4 3.4.34 
577 
408 
= 1.4142156. (4) 


à 


7 Baudhàyanasulvasütram (1.61-2), in The S'ulvasütras, Ed. by S. N. Sen and A. K. Bag, New 
Delhi 1983, p. 19. 
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The above approximation is accurate to 5 decimal places. Baudhayana-Sulva- 
sutra—in the context of discussing the problem of circling a square—also gives 
an approximation for z : ? 


aqa Hugo faa Paras aa 
^ A a 
gafara qva wggdras avs af 
If it is desired to transform a square into a circle, [a cord of length] half the 
diagonal of the square is stretched from the centre to the east; with one-third [of 


the part lying outside] added to the remainder [of the half-diagonal] the [required] 
circle is drawn. 


If a is half-the side of the square, then the radius r of the circle is given by 
a 
rA (5) (22+ 42). (5) 


This corresponds to z 7 3.0883. 


3.2. Algorithm for square-roots in Aryabhatiya 


The Aryabhatzya of Aryabhata (c. 499 CE) gives a general algorithm for com- 
puting the successive digits of the square root of a number. The procedure, given 
in the following verse, is elucidated by us via an example:? 


ani guesuifacd fg quos UT 
Tigh Ts TH LIAR Aa 


N 
UA 


Always divide the non-square (even) place by 14) 
twice the square-root [already found]. Having sub- 
tracted the square [of the quotient] from the square 2ESS 
(odd) place, the quotient gives the [digit in the] 2 S 

next place in the square-root. 7000 


A 
- |o O0 


3.3. Approximating the square-root of a non-square number 


The method for obtaining approximate square-root (@sanna-mila) of a non-square 
number (amülada-ràsi) is stated explicitly in Trisatika of Sridhara (c. 750):19 


8 Baudháyanasulvasütram (1.58), ibid., p. 19. 


9 Aryabhatiya of Aryabhata, Ed. by K. S. Shukla and K. V. Sarma, New Delhi 1976, 
Ganitapada 4, p. 36. 
10 Trisatikà of Sridhara, Ed. by Sudhakara Dvivedi, Varanasi 1899, verse 46, p. 34. 
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TNA eae To Safer 
ae Sor far gA i 


Multiply the non-square number by some large square number, take the square- 
root [of the product] neglecting the remainder, and divide by the square-root of 
the multiplier. 


Narayana Pandita (c. 1356) has noted that the solutions of varga-prakrti (the 
so called Pell's equation) can be used to compute successive approximations to the 
square-root of a non-square number:! ! 


HO Wel FA w TTI Ve TA | 
Se FAIS w VAST ASAT | 


[With the number] whose square-root is to be found as the prakrtz and unity as 
the ksepa, [obtain the greater and smaller] roots. The greater root divided by the 
lesser root is an approximate value of the square-root. 


Narayana considers the example 
10x? + 1 — y?, (6) 
and gives the approximate values: 


19 721 27379 
M02 = es 
6” 228’ 8658 


which are obtained by successive compositions (bhavana) of the solution x = 
6, y 29;? 


(7) 


228 = (2)(6)(19), 721 = (10)(6)? + (19)?, and so on. 


3.4. Approximate value of z in Aryabhatzya 


Aryabhata (c. 499) gives the following approximate value for z :'? 


TAS raae GrP RUIT TESTA | 
AAG UIST A qunm: I 


1l Ganitakaumudt of Narayana Pandita, Ed. by Padmakara Dvivedi, Part II, Benaras 1942, 
verse 10.17, p. 244. 

12 Bhavand or the rule of composition enunciated by Brahmagupta is the transformation (X, Y) 
(X? + DY?, 2XY) which transforms a solution x = X, y = Y of the equation x? — Dy? — 1, into 
another solution with larger values for x, y, which correspond to higher convergents in the continued 
fraction expansion of JD and thus give better approximations to it. 

13 4 ryabhatiya, cited above, Ganitapada 10, p. 45. 
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One hundred plus four multiplied by eight and added to sixty-two thousand: This 
is the approximate measure of the circumference of a circle whose diameter is 
twenty-thousand. 


o 


2832 _ 3.1416. 


Thus as per the above verse z © 55505 


3.5. Successive doubling of the sides of the circumscribing polygon 


It appears that Indian mathematicians (at least in the Aryabhatan tradition) 
employed the method of successive doubling of the sides of a circumscribing 
polygon—starting from the circumscribing square leading to an octagon, etc.— 
to find successive approximations to the circumference of a circle. This method 
has been described in the later Kerala texts Yuktibhasa (c. 1530) of Jyesthadeva 
and Kriyakramakari commentary (c. 1535) of Saükara Variyar on Lilavati, of 
Bhaskara II. The latter cites the verses of Madhava (c. 1340-1420) in this con- 
nection and notes at the end that: '* 

Ua qaae WpEHGTHTUISÉUd TAA! 

Thus, one can obtain [an approximation to the circumference of the circle] to any 

desired level of accuracy. 


FIGURE 1. Finding the circumference of a square from circum- 
scribing polygons. 


l4 Līlāvatī of Bhaskara II, Ed. with commentary Kriyakramakar: of Sankara Variyar by 
K. V. Sarma, Hoshiarpur 1975, comm. on verse 199, p. 379. 
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We now outline this method as described in Yuktibhasa.'> In Figure 1, EOS A, 
is the first quadrant of the square circumscribing the given circle. EA, is half the 
side of the circumscribing square. Let OA; meet the circle at Cj. Draw A2C, B2 
parallel to ES. EA» is half the side of the circumscribing octagon. 

Similarly, let O A2 meet the circle at C2. Draw A3C2B3 parallel to EC). EA3 
is now half the side of a circumscribing regular polygon of 16 sides. And so on. 
Let half the sides of the circumscribing square, octagon etc., be denoted 


lj = BA, b= EA2, 13 = EA3, ... (8) 
The corresponding karnas (diagonals) are 
kj = OA;, k2 = OA2, k3 = OAs, ... (9) 
And the abhadhas (intercepts) are 
a, = D\ Aj, a2 = D742, a3 = D343, ... (10) 
Now 


hy mg ki =V2r and a, = —. (11) 


Using the bhuja-koti-karna-nyaya (Pythagoras theorem) and trairasika-nyaya 
(rule of three for similar triangles), it can be shown that 


l 
b = m a (12) 
al 
B= re. (13) 
k2-— 2. [2 
and a = ER (14) 
2 


In the same way /,,41, kn+1 and a,4) are to be obtained in terms of ln, kn and 
án. These can be shown to be equivalent to the recursion relation: © 


e = [J (2 +12) — r]. (15) 


15 Ganita-yukti-bhasa of Jyesthadeva, Ed. and Tr. by K. V. Sarma, with Exp. Notes by K. Ra- 
masubramanian, M. D. Srinivas and M. S. Sriram, 2 Vols, Hindustan Book Agency, New Delhi 2008. 
Reprint Springer 2009, Vol. I Section 6.2, pp. 46—49, 180-83, 366—69. 


161f we set r = 1 and In = tan ĝn, then equation (15) gives /,4., = tan (&). Actually, On = z&r 
and the above method is based on the fact that for large n, 2” tan 7 ~ 2" ET = Fe 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Development of Calculus in India 211 


4. Summation (and repeated summations) of powers of natural numbers 
(sankalita) 


4.1. Sum of squares and cubes of natural numbers in Aryabhatiya 
The ancient text Brhaddevata (c. 5‘ century BCE) has the result 
2+3+...+ 1000 = 500, 499. (16) 


Aryabhata (c. 499 CE), in the Ganitapada of Aryabhatiya, deals with a general 
arithmetic progression in verses 19-20. He gives the sum of the squares and cubes 
of natural numbers in verse 22:!7 


Àrea erg raat WELS | 
afar: a 24g fafa wafatques i 


The product of the three quantities, the number of terms plus one, the same in- 
creased by the number of terms, and the number of terms, when divided by six, 
gives the sum of squares of natural numbers (varga-citi-ghana). The square 
of the sum of natural numbers gives the sum of the cubes of natural numbers 
(ghana-citi-ghana). 


In other words, 


n(n + 1)(2n + 1) 


17:2? 4.92 ET MESS EE Ss (17) 
1?-ro3 rop = MEREEN 
n(n 4-1)? 
——|. 18 
| 2 | d 


4.2. Repeated sum of natural numbers in Aryabhatiya 


Aryabhata also gives the repeated sum of the sum of the natural numbers (sarika- 
lita-sankalita or vara-sarikalita):!8 

varareufadtanree reaaaat: | 

age: ou fufqua: Feuer fazeit ari 

Of the series (upaciti) 1, 2, ..., n, take three terms in continuation of which the 

first is the given number of terms (gaccha), and find their product; that [product], 


or the number of terms plus one subtracted from its own cube divided by six, gives 
the repeated sum (citi-ghana). 


17 Aryabhatiya, cited above, Ganitapada 22, p. 65. 
18 4 ryabhatiya, cited above, Ganitapada 21, p. 64. 
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We have 
l 
enon Ls. (19) 


Aryabhata’s result expresses the sum of these triangular numbers in two forms: 


(1 + 1) (2+ 1) (n + 1) [n (n + 1)(n + 2)] 
1 2 +2 2 +...4n 2 = 6 
MS 
E [(n + 1) - (n+ DI (20) 


4.3. Narayana Pandita's general formula for Varasankalita 


In his Ganita-kaumud:, Narayana Pandita (c. 1356) gives the formula for the 
r'^-order repeated sum of the sequence of numbers 1, 2, 3, ..., ne 


YANA anA: warfa€edreur yrs ASTM: | 
—— e 


The pada (number of terms in the sequence) is the first term [of an arithmetic 
progression] and 1 is the common difference. Take as numerators [the terms 
in the AP] numbering one more than vara (the number of times the repeated 
summation is to be made). The denominators are [terms of an AP of the same 
length] starting with one and with common difference one. The resultant product 
is vara-sankalita. 


Let 
] 
[+2434..¢n= 24) _ yn, (21) 
Then, Narayana’s result is 
yo = Y) + PD VD (22) 
[n(n + 1)...(n 4 r)] (23) 
[1:22 m+] 


Narayana's result can also be expressed in the form of a sum of polygonal num- 
bers: 


Sy, aD 3 [n(n 4- 1) ... (n - r)] 


= (24) 
[12557] [1:27 (r. 4-1] 


m=1 


19 Ganitakaumudt of Narayana Pandita, Ed. by Padmakara Dvivedi, Part I, Benaras 1936, verse 
3.19-20, p. 123. 
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This result can be used to evaluate the sums $77. ., k^, Jii I, ... by induction. 
It can also be used to estimate the behaviour of these sums for large n. 


4.4. Summation of geometric series 


The geometric series 1 + 2 + DE ' is summed in Chapter VIII of Pingala's 
Chandah-sütra (c. 300 BCE). As we AL earlier, Pingala also gives an 
algorithm for evaluating any positive integral power of a number (2 in this context) 
in terms of an optimal number of squaring and multiplication operations. 

Mahàviracárya (c. 850), in his Ganita-sára-sarigraha gives the sum of a geo- 
metric series and also explains the Pingala algorithm for finding the required power 
of the common ratio between the terms of the series:?? 


qafzreqorefenmforua: CAG TATH | 


verona qurargferd: faamdrant i 
waren dr sored asterfe dt am: | 


sia: maA Aaa: HRH II 


The first term when multiplied by the product of the common ratio (guna) taken 
as many times as the number of terms (pada), gives rise to the gunadhana. 
This gunadhana, when diminished by the first term and divided by the common 
ratio less one, is to be understood as the sum of the geometrical series (quna- 
sankalita). 


That is 


a(r" — 1 
asear-perteece ee (25) 
(12-2) 
Virasena (c. 816), in his commentary Dhavala on the Satkhandagama, has made 
use of the sum of the following infinite geometric series in his evaluation of the 
volume of the frustum of a right circular cone:?! 


1 l 2 l n 4 
ies (2) +...+(4) ind e: (26) 


The proof of the above result is discussed in the Aryabhatiya-bhasya (c. 1502) 
of Nilakantha Somayaji. As we shall see later (section 10.1), Nilakantha makes 
use of this series for deriving an approximate expression for a small arc in terms 
of the corresponding chord in a circle. 


20 Ganitasarasarigraha of Mahaviracarya, Ed. by Lakshmi Chanda Jain, Sholapur 1963, 
verses 2.93-94, pp. 28-29. 


21see. for instance, T. A. Sarasvati Amma, Geometry in Ancient and Medieval India, Motilal 
Banarsidass, Delhi 1979, Rep. 2007, pp. 203-05. 
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5. Use of Second-order differences and interpolation in computation of 
Rsines (Jyanayana) 


Jyà, Koti and Sara 


The jyà or bhuja-jya of an arc of a circle is actually the half-chord (ardha-jyà 
or jyardha) of double the arc. In the Figure 2, if R is the radius of the circle, jyà 
(Rsine), koti or koti-jyà (Rcosine) and sara (Rversine) of the cápa (arc) EC are 
given by: 


jya (arc EC) 2 CD = Rsin(ZCOE) (27) 
koti (arc EC) - OD = Rcos(ZCOE) (28) 
sara (arc EC)=ED = Rvers(LCOE) 

R — Rcos(ZCOE). (29) 


For computing standard Rsine-tables (pathita-jyà), the circumference of a circle 


— E al. 


N 


[9) 


FIGURE 2. Jyà, Kotiand Sara. 


is divided into 21600' and usually the Rsines are tabulated for every multiple of 
225’, thus giving 24 tabulated Rsines in a quadrant. Using the value of z ~ 
Xs — 3.1416, given by Aryabhata, the value of the radius then turns out to be 
3437’ 44" 19". This is accurate up to the seconds, but is usually approximated 
to 3438’. Using a more accurate value of z, Madhava (c. 1340-1420) gave the 
value of the radius correct to the thirds as 3437’ 44" 48" which is also known by 


the Katapayadi formula devo-visvasthali-bhrguh. 
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5.1. Computation of Rsines 


Once the value of the radius R is fixed (in units of minutes, seconds etc.) the 24 
Rsines can be computed (in the same units) using standard relations of jyotpatt: 
(trigonometry). For instance, Varaharnihira has given the following Rsine values 
and relations in his Paricasiddhantika (c. 505)? 


Rsin(30°) = 5 (30a) 
R sin(45°) R (30b) 
S1 = == 
J/2 
3 
Rsin(60°) = aE (30c) 
Rsin(90°) = R (30d) 
Rsin(A) = Rcos(90— A) (31) 
Rsin'(A)-- Rcos*(A) = R? (32) 
1 
R sin (5) E (5) [R sin? (A) +R vers? (A)]? 
1 
2 
= (5) [R — Rcos A]}. (33) 


The above Rsine values (30) and relations (31)-(33) can be derived using the 
bhuja-koti-karna-nyaya (Pythagoras theorem) and trairasika (rule of three for 
similar triangles), as is done for instance in the Vasana-bhasya of Prthüdaka- 
svàmin (c. 860) on Brahmasphutasiddhanta (c. 628) of Brahmagupta. Equa- 
tions (30)-(33) can be used to compute all 24 tabular Rsine values. 


5.2. Aryabhata's computation of Rsine-differences 


The computation of tabular Rsine values was made much simpler by Aryabhata 
who gave an ingenious method of computing the Rsine-differences, making use of 
the important property that the second-order differences of Rsines are proportional 
to the Rsines themselves:2? 


2? Paicasiddhàntikà of Varáhamihira, Ed. by T. S. Kuppanna Sastry and K. V. Sarma, Madras 
1993, verses 4.1-5, pp. 76-80. 
234 ryabhatiya, cited above, Ganitapada 12, p. 51. 
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wwarngnmsunees4 uud fdas 
TATASATITS EAE ATT Srrfor i 

The first Rsine divided by itself and then diminished by the quotient will give 
the second Rsine-difference. The same first Rsine, diminished by the quotients 
obtained by dividing each of the preceding Rsines by the first Rsine, gives the 
remaining Rsine-differences. 


Let B; = Rsin (225), B5 = R sin (450’),..., B24 = Rsin (90°), be the 
twenty-four Rsines, and let A; = Bj, A2 = B2 — By,..., Ay = By — By-i, 
be the Rsine-differences. Then, the above rule may be expressed as?* 


B 

EUR (34) 
By 
BeBe. +! By 

Ana Seppe USED EET RO (51,2, ..:.,23): Ga) 


By 
This second relation is also sometimes expressed in the equivalent form 


A A ... + Ag 
Ag+ = Ap SUE Eo A0 E v E23). (36) 
1 


From the above it follows that 


Mia = A See E2 23). (37) 


Since Aryabhata also takes A; = Bı = Rsin(225’) ~ 225’, the above relations 
reduce to 


I MES ones (38) 
—B, 
Ana — Ax p (esr low 9155): (39) 


5.3. Derivation of the Aryabhata-relation for the second-order 
Rsine-differences 


Aryabhata's relation for the second-order Rsine-differences is derived and made 
more exact in the Aryabhatiya-bhasya (c. 1502) of Nilakantha Somayaji and 
Yuktibhasa (c. 1530) of Jyesthadeva. We shall present a detailed account of 


24K ryabhata is using the approximation A2 — A, ~ 1’ and the second terms in the RHS of 
(34)-(36) and the RHS of (37) and (39) have an implicit factor of (Az — A1). See (45) below which is 
exact. 
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the first and second-order Rsine-differences as given in Yuktibhasa ?5 later in 
Section 16. Here we shall only summarize the argument. 

In Figure 3, the arcs EC; and ECj., are successive multiples of 225’. The 
Rsine and Rcosine of the arcs EC; and EC j., are given by 


Bj = CjPj, Bii = C+ Pjai (40) 
and Kj C;Tj, Kj = Cj+iTj+1, (41) 


respectively. Let Mj+ı and Mj be the mid-points of the arcs CjCjii;, Cj—-1C;j 
and the Rsine and Rcosine of the arcs EM; and EMj;+, be denoted respectively 


by Bj Bic» Kj i jeu 


N 


Fu Yu Tj Y o 


FIGURE 3. Derivation of Aryabhata relation. 


25 Ganita-yukti-bhasa, cited above, Section 7.5.1, pp. 94-96, 221-24, 417-20. 
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Let the chord of the arc C; Cj +1, be denoted by a and let R be the radius. Then 
a simple argument based on trairásika (similar triangles) leads to the relations:7° 


By By = (5) Kya (42) 
Kya-Kj = (=) By. (43) 
Thus we get 
Aj41-—Aj = (Bj+1 — Bj) - (Bj — Bj-1) 
CES a 
We can also express this relation in the form 
Aja = aj = SD. (45) 


The above relations are exact. Aryabhata’s relation (39) corresponds to the ap- 
proximations, B1 ~ 225’ and A; — A2 « |’ so that 


(3) s ALS E (zz): (46) 


27 


In Tantrasangraha, Nilakantha Somayaji has given the finer approximation: 


(4) _ Qe ( 1.) (47) 
R Bi 2335 


26Equations (42) and (43) are essentially the relations: 


Rsin(x++h)—Rsinx = ( 


R cos h R cos aw 
2 RS 


with a — 2R sin 5. These lead to (44) in the form: 


R j h 
) cos {| x + 2 


) Rsinx, 


I 
—_— 
DİR DİR 


an2 
(R sin(x + h) — R sin x) — (R sin x — R sin(x — h)) = — ($) Rsinx. 


27 Tantrasangraha of Nilakantha Somayājī, Ed. with Laghu-vivrti of Sankara Variyar by 
S. K. Pillai, Trivandrum 1958, verse 2.4, p. 17. 
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This is further refined by Sankara Variyar in his commentary Laghu-vivrti in the 


form:?8 
a\2 (^1 — ^2) ] 
(4) = €um) (48) 


Since « = 2R sin 112'30", we find that the above relation is correct up to seconds. 
Commenting on Aryabhata’s method of computing Rsines, Delambre had re- 

marked:?? 

The method is curious: it indicates a method of calculating the table of sines by 

means of their second-differences... This differential process has not up to now 

been employed except by Briggs. who himself did not know that the constant 

factor was the square of the chord AA (= 3°45’) or of the interval, and who could 

not obtain it except by comparing the second differences obtained in a different 

manner. The Indians also have probably done the same; they obtained the method 

of differences only from a table calculated previously by a geometric process. 

Here then is a method which the Indians possessed and which is found neither 

amongst the Greeks, nor amongst the Arabs. 


5.4. The Rsine-table of Aryabhata 


In the Gitika-pada of Aryabhatiya, Aryabhata has given a table of Rsine- 
differences:?? 


afa ufu ofa ufu ufu afe 

ofe gue wate few vate fau 

toate fau gag ufa fea 

Vs Fg FAIS SOT I 

225, 224, 222, 219, 215, 210, 205, 199, 191, 183, 174, 164, 154, 143, 131, 119, 
106, 93, 79, 65, 51, 37, 22, and 7—these are the Rsine-differences [at intervals of 
225° of arc] in terms of the minutes of arc. 


The above values follow directly from Aryabhata’s relation (39) for the second 
order Rsine-differences. To start with, Ay = B, = Rsin(225’) ~ 225’. Then we 
get, Ao = Bı — r1 — 224' and so on. 

The Rsine-table of Aryabhata?! (see Table 1), obtained this way, is accurate up 
to minutes. In this table, we also give the Rsine values given by Govindasvamin 
(c. 825) in his commentary on Mahabhaskariya of Bhaskara I, and by Madhava 


?8pbid.. comm. on verse 2.4. 

2°Delambre, Historie de I’ Astronomie Ancienne, t 1, Paris 1817, pp. 457, 459f, cited from 
B. B. Datta and A. N. Singh, “Hindu Trigonometry’, Ind. Jour. Hist. Sc. 18, 39-108, 1983, p. 77. 

30 4 ryabhatiya, cited above, Gitikapada 12, p. 29. 

31See, for instance, A. K. Bag, Mathematics in Ancient and Medieval India, Varanasi 1979, 
pp. 247-48. 
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(c. 1340-1420) as recorded in the Aryabhatiya-bhasya (c. 1502) of Nilakantha 
Somayaji. Though Govindasvamin gives the Rsine values up to the thirds, his 
values are accurate only up to the seconds; those of Madhava are accurate up to 


the thirds. 


Arc-length | Aryabhata | Govindasvamin Madhava 
(c. 499) (c. 825) (c. 1375) 

3°45’ 2257 224’ 50" 23” 224507 22 
7°30’ 449’ 448’ 42" 53” 448' 42" 58" 
"ores 671’ 670’ 40" 11” 670’ 40" 16” 
15°00’ 890’ 889! 45” 08” 889! 45” 15” 
18°45’ 1105’ 1105’ 01" 30" | 1105’ 01” 39" 
22°30! 1315’ 1315’ 33" 56" | 1315’ 34” 07" 
26*15' 1520' 1520228722" 1-1520' 28" 35/7 
30°00’ 1719! 1719552/4 1075. 1718! 52% 247 
33?45' 1910' 1909' 54" 190" | 1909' 54" 35” 
37*30' 2093' 2092' 45" 46" | 2092' 46" 03" 
41?15' 2267’ 2266’ 38” 44" | 2266’ 39” 50” 
45°00’ 2431’ 2430’ 50" 54" | 2430' 51" 15" 
48?45' 2585' 2584' 37" 43" | 2584' 38" 06” 
52?30' 27728' 2722022297 \\ 2727 207 527 
56°15’ 2859’ 2858' 22" 31" | 2858’ 22” 554 
60°00’ 2978’ 2977’ 10" 090" | 2977’ 10" 34" 
63?45' 3084’ 3083’ 12" 51" | 3083’ 13" 17" 
67°30’ 3177’ 3176’ 03" 23" | 3176’ 03" 50” 
71°15’ 3256 3255' 17" 54" | 3255! 18" 22" 
75°00 3321’ 3320’ 36" 02" | 3320’ 36” 30" 
78°45’ 3372! 3371'41" O01" | 3371' 41" 29” 
82?30' 3409' 3408’ 19" 42" | 3408' 20" 11” 
86*15' 3431’ 3430! 22" 42” aen PPS WI 
90°00’ 3438’ 3437' 44" 19" | 3437 44” 48” | 


TABLE 1. Rsine-table of Aryabhata, Govindasvamin and Madhava. 


5.5. Brahmagupta’s second-order interpolation formula 


The Rsine table of Aryabhata gives only the Rsine values for the twenty-four 
multiples of 225’. The Rsines for arbitrary arc-lengths have to be found by inter- 
polation only. In his Khandakhadyaka (c. 665), Brahmagupta gives a second- 
order interpolation formula for the computation of Rsines for arbitrary arcs. In this 
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work, which is in the form of a manual (karana) for astronomical calculations, 
Brahmagupta uses a simpler Rsine-table which gives Rsines only at intervals of 
15° or 900? 


Tuer STM Ta aT | 
Rea att Saraaah Hay ll 


Multiply the residual arc after division by 900’ by half the difference of the tabular 
Rsine difference passed over (gata-khanda) and to be passed over (bhogya- 
khanda) and divide by 900’. The result is to be added to or subtracted from half 
the sum of the same tabular sine differences according as this [half-sum] is less 
than or equal to the Rsine tabular difference to be passed. What results is the true 
Rsine-difference to be passed over. 


Let / be the basic unit of arc in terms of which the Rsine-table is constructed, 
which happens to be 225’ in the case of Aryabhatzya, and 900’ in the case of 
Khandakhadyaka. Let the are for which Rsine is to be found be given by 


s=jhte for some j = 0,1,... (49) 


Now Rsin(jh) = Bj are the tabulated Rsines. Then, a simple interpolation 
(trairasika) would yield 


€ 
Rsin(jh-ke) = Bj (5) (Bj+ı — Bj) 


Rsin(jh) + =A jt. (50) 


Instead of the above simple interpolation, Brahmagupta prescribes 


Ay ~ ye 
Rsin(jh +6) = B; + (=) [E ) (a+ aye (z) Scran] (51) 


Here, the sign is chosen to be positive if A; < Aj+1, and negative if A; > Aj<| 
(as in the case of Rsine). So Brahmagupta’s rule is actually the second-order 
interpolation formula 


Rsin(jh) + (=) G) OANE (=) (Aj v 


Rsin(jh) + (3) CAT cnt) + EY (Aj+1— Aj) 


ll 


Rsin(jh + €) 


Il 


2 2 
= REG E (=) Aj+i + (55) [; = 1] Gma, (52) 


32 Khandakhadyaka of Brahmagupta, Ed. by P. C. Sengupta, Calcutta 1941, p. 151. 
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6. Instantaneous velocity of a planet (tatkalika-gati) 
6.1. True daily motion of a planet 


In Indian Astronomy, the motion of a planet is computed by making use of two 
corrections: the manda-samskara which essentially corresponds to the equation 
of centre and the stghra-samskara which corresponds to the conversion of the 
heliocentric longitudes to geocentric longitudes. The manda correction for planets 
is given in terms of an epicycle of variable radius r, which varies in such a way 
that 


——-—, (53) 


where K is the karna (hypotenuse) or the (variable) distance of the planet from 
the centre of the concentric and ro is the tabulated (or mean) radius of the epicycle 
in the measure of the concentric circle of radius R. 


P 


e a 


FIGURE 4. Manda correction. 


In Figure 4, C is the centre of concentric on which the mean planet Po is lo- 
cated. CU is the direction of the ucca (aphelion or apogee as the case may be). 
P is the true planet which lies on the epicycle of (variable) radius r centered at 
Po, such that PoP is parallel to CU. If M is the mean longitude of a planet, a the 
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longitude of the ucca, then the correction (manda-phala) Ay is given by 


r , 
Rsin(Au) = (z) Rsin(m — a) 


ll 


(=) Rsin(M —a). (54) 


For small r, the left hand side is usually approximated by the arc itself. The 
manda-correction is to be applied to the mean longitude M, to obtain the true or 
manda-corrected longitude jz given by 


ro 1 F 
pev E) (3) Rsin(M — a). (55) 
If nm and n, are the mean daily motions of the planet and the ucca, then the true 
longitude on the next day is given by 


1 
nn - (M +1m) - (2) (z) R sin (M + nm —a — ny). (56) 


The true daily motion is thus given by 


r l : 1 
ncm (2) (x) [Rsin{(M — a) + (nm — nu)} — Rsin (M —a)]. (57) 
The second term in the above is the correction to mean daily motion (gati-phala). 
An expression for this was given by Bhaskara I (c. 629) in Mahabhaskariya, 
where he makes use of the approximation:?? 


Rsin((M —a) + (nm —n4)) | _ | Um T Mu) x 
— Rsin(M — a) | = (as) Rsine-difference at (M — a). 

(58) 

In the above approximation, (m — n,) is multiplied by tabular Rsine-difference 

at the 225' arc-bit in which (the tip of the arc) (M — a) is located. Therefore, 

under this approximation, as long as the anomaly (kendra), (M — a), is in the 

same multiple of 225', there will be no change in the gati-phala or the correction 

to the mean velocity. This defect was noticed by Bhaskara also in his later work 

Laghubhaskariya:?* 


arfzrere war HS TT PITECUT: | 


E ESIEIEIEEESHEHE ES ERE ETE E) 


` 


33 Mahābhāskarīya of Bhāskara I, Ed. by K. S. Shukla, Lucknow 1960, verse 4.14, p. 120. 
34 Laghubhaskariya of Bhāskara I, Ed. by K. S. Shukla, Lucknow 1963, verses 2.14-5, p. 6. 
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Vana Stara fasted | 
2 
Whilst the Sun or the Moon moves in the [same] element of arc, there is no change 


in the rate of motion (bhukti), because the Rsine-difference does not increase or 
decrease; viewed thus, the rate of motion [as given above] is defective. 


‘The correct formula for the true daily motion of a planet, employing the Rcosine 
as the ‘rate of change’ of Rsine, seems to have been first given by Munjala (c. 932) 


in his short manual Laghumanasa ?? and also by Aryabhata II (c. 950) in his 
Maha-siddhanta:*° 


PTEN Hasan Haren I 


The kotiphala multiplied by the [mean] daily motion ‘and divided by the radius 
gives the minutes of the correction [to the rate of the motion]. 


This gives the true daily motion in the form 
ro l 
n = ng — (Nm — Nu) (5) (x) R cos(M — a). (59) 


6.2. The notion of instantaneous velocity (tatkalikagatz) according to 
Bhaskaracarya II 


Bhaskaracarya II (c. 1150) in his Siddhantasiromani clearly distinguishes the 
true daily motion from the instantaneous rate of motion. And he gives the Rcosine 
correction to the mean rate of motion as the instantaneous rate of motion. He 
further emphasizes the fact that the velocity is changing every instant and this is 
particularly important in the case of Moon because of its rapid motion.?? 


PATA CATS Nd: SCT AAAS Il 
e Tenet gga aH Ag efiam | 
Ta Fara Tefal ARNET CATT 


miaa fie genres quu 
TGA AAA Te: WRTBUDIT A GAT AEST: N 


The true daily motion of a planet is the difference between the true EN on 
successive days. And it is accurate (sphuta) over that period. The kotiphala 
(Rcosine of anomaly) is multiplied by the rate of motion of the manda-anomaly 
(mrdu-kendra-bhukti) and divided by the radius. The result added or subtracted 
from the mean rate of motion of the planet, depending on whether the anomaly is 
in Karkyddi or Mrgadi, gives the true instantaneous rate of motion (tatkalikt 
manda-sphutagati) of the planet. 


35 Laghumanasa of Munjala, Ed. by K. S. Shukla, New Delhi 1990, verse 3.4, p. 125. 
36 Mahasiddhünta of Aryabhata Il, Ed. by Sudhakara Dvivedi, Varanasi 1910, verse 3.15, p. 58. 


37 Siddhantasiromani of Bhaskaracarya, Ed. by Muralidhara Chaturvedi, Varanasi 1981, 
verses 2.36-8, p. 119. 
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In the case of the Moon, the ending moment of a tithi® which is about to end or 
the beginning time of a tithi which is about to begin, are to be computed with the 
instantaneous rate of motion at the given instant of time, The beginning moment 
of a tithe which is far away can be calculated with the earlier [daily] rate of 
motion. This is because Moon's rate of motion is large and varies from moment 
to moment. 


Here, Bhaskara explains the distinction between the true daily rate of motion 
and the true instantaneous rate of motion. The former is the difference between 
the true longitudes on successive days and it is accurate as the rate of motion, on 
the average, for the entire period. The true instantaneous rate of motion is to be 
calculated from the Rcosine of the anomaly (kotiphala) for each relevant moment. 

Thus if c, and @, are the rates of the motion of the mean planet and the ucca, 
then Om — @, is the rate of motion of the anomaly, and the true instantaneous rate 
of motion of the planet at any instant is given by Bhaskara to be 


l 
O = Om + (Om — Ou) (>) (x) R cos(M — a), (60) 


where (M — a) is the anomaly of the planet at that instant. 
Bhaskara explains the idea of the instantaneous velocity even more clearly in 


aqaa peA: Aaea Waa 
farata aa wore ur per fd: | aaar 
aa qrni aaa aft) quy PA 
ma aam mn zesrafaq usud afai gu fea 


his Vasana: 


ona wo sea a] eRe eee RENE 
Ta AAA ed Goad TH TAGE Wird 


Ra RU T THA Tara IAT 
Tae | Head Tes Hao TAT A 
aa anreredau fdgutsfafes: i 


38 Tithiis the time taken by the Moon to lead the Sun exactly by 12° in longitude. 
39 Giddhàntasiromani, cited above, Vasana on 2.36-38, p. 119-20. 
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Thus, Bhaskara is here conceiving also of an instantaneous Rsine-difference, 
though his derivation of the instantaneous velocity is somewhat obscure. These 
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COMEICITZEIESE UBRO S ESESES ESSE ETE ERES ofa: | 
ME UTE am Weg 

EU sese rapaces apie 

wos q« Wr yar RR: (22s) AAT! aA aad 


The true daily velocity is the difference in minutes etc., between the true planets 
of today and tomorrow, either at the time of sunrise, or mid-day or sunset. If to- 
morrow’s longitude is smaller than that of today, then we should understand the 
motion to be retrograde. It is said “over that period”. This only means that, dur- 
ing that intervening period, the planet is to move with this rate [on the average]. 
This is only a rough or approximate rate of motion. Now we shall discuss the 
instantaneous rate of motion... In this way, the manda-corrected true instanta- 
neous rate of motion (tatkalikt manda-parisphutagati) is calculated. In the 
case of Moon, this instantaneous rate of motion is especially useful...Because of 
its largeness, the rate of motion of Moon is not the same every instant. Hence, in 
the case of Moon, the special [instantaneous] rate of motion is stated. 


Then, the justification for the correction to the rate of motion (gati-phala- 
vàsana). . .The rate of motion of the anomaly is the difference in the anomalies of 
today and tomorrow. That should be multiplied by the [current] Rsine-difference 
used in the computation of Rsines and divided by 225. Now, the following rule 
of three to obtain the instantaneous Rsine-difference: If the first Rsine-difference 
225 results when the Rcosine is equal to the radius, then how much is it for the 
given Rcosine. In this way, the Rcosine is to be multiplied by 225 and divided 
by the radius. The result is the instantaneous Rsine-difference and that should be 
multiplied by the rate of motion in the anomaly and divided by 225... 
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ideas are more clearly set forth in the Aryabhatiya-bhasya(c. 1502) of Nilakantha 
Somayaji and other works of the Kerala School. 


6.3. The stghra correction to the velocity and the condition for retrograde 
motion 


Bhaskara then goes on to derive the correct expression for the true rate of motion 
as corrected by the sighra-correction. In the language of modern astronomy, the 
Sighra-correction converts the heliocentric longitude of the planets to the geocen- 
tric longitudes. Here also, the Indian astronomers employ an epicycle, but with a 
fixed radius, unlike in the case of the manda-correction. 

If u is the manda-corrected (manda-sphuta) longitude of the planet, ¢ is the 
longitude of the sighrocca, and rs, the radius of the sighra-epicycle, then the cor- 
rection (sighra-phala) Ac is given by 


Rsin(Ac) = (z) R sin (u — C), (61) 


where (4 — č) is the sighrakendra and K is the hypotenuse (stghrakarna) given 
by 


K? = R? +r? — 2Rr, cos (4 — C). (62) 


The calculation of the stghra-correction to the velocity is indeed much more dif- 
ficult as the denominator in (61), which is the hypotenuse which depends on 
the anomaly, also varies with time in a complex way. This has been noted by 
Bhàskara who was able to obtain the correct form of the sighra-correction to the 
velocity (stghra-gati-phala) in an ingenious way.”? 


manag aAA AA saa: aeaa | 
ad: ede: ag a amt RAA 1 


The Rsine of ninety y degrees, less the degrees of sighra-correction for the longi- 
tude (Stghra-phala), should be multiplied by the rate of motion of the szghra- 
anomaly (drak-kendra-bhukti) and divided by the hypotenuse (stghra-karna). 
This, subtracted from the rate of motion of the Sighrocca, gives the true velocity 
of the planet. If this is negative, the planet’s motion is retrograde. 


If c is the rate of motion of the manda-corrected planet and c; is the rate of 
motion of the szghrocca, then the rate of motion of the sighra-anomaly is (c — c ). 


40 Giddhantasiromani, cited above, verse 2.39, p. 121. 
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and the true velocity of the planet c; is given by 


(63) 


[= — w)R ean] 
CQ = Ws — 5 


K 


The details of the ingenious argument given by Bhaskara for deriving the correct 
form (63) of the sighra-correction to the velocity has been outlined by D. Arkaso- 
mayaji in his translation of Siddhantasiromani."! 


Since Bhaskara's derivation is somewhat long-winded, here we shall present 


a modern derivation of the result just to demonstrate that the expression given by 
Bhaskara is indeed exact. 


FIGURE 5a. Velocity of a planet as seen from the Earth. 


In Figure 5a, S, E and P represent the positions of the Sun, Earth and an 
exterior planet respectively. Let v and v; be the linear velocities of the planet and 
the Earth with respect to the Sun. PP’ and E E' are lines perpendicular to the line 
EP joining the Earth to the planet. Let R, represent the radii of the orbits of 
the planet and the Earth (assumed to be cicular) around the Sun respectively and 
K, the distance of the planet from the Earth. For an exterior planet, the Szghra- 
correction Ac is given by the angle SP E. 


41D., Arkasomayaji, Siddhantasiromani of Bhaskaracarya, Tirupati 1980, pp. 157-161. 
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If v, is the linear velocity of the planet as seen from the Earth, then the angular 
velocity is given by 


0O ===. (64) 


The magnitude of v; in terms of v and v, (for the situation depicted in Figure 5a) 
is 


D, = V COS Ao + v, cos@. (65) 


Also, from the triangle S E P, the distance of the planet from the Earth—known as 
karna, and denoted K in the figure—may be expressed as 


K = Rcos Ac +rcos8, 
K —RcosA 
or cos@ = coma (66) 
r 
Using (66) in (65) we have 
v = ovcosAg + PSK — Rcos Ac) 
r 
K R 
= N (0-4) 
r r 
R 
m vs . cos Ac (v — 0,7) 
— mm dz oc m E ee 7 67 
or K T K (67) 


Making use of (64) and the fact that v = Rw and vs = ræs, the above equation 
reduces to 


(c; — w)R cos ^a 
Q, = Ws — LS C , 


which is same as the expression given by Bhaskara (63). 

Bhaskara in his Vasana* explains as to why in the Sighra process a different 
procedure for finding the rate of motion of the planet has to be employed than the 
one used in the manda process: 


S2tpbid., Vasand on 2.39. 
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We he Cit feast waaay ae 
Wert AMASIA FRÀ ungui we a ATES ETA 
ated Aes Hot Hse aes quem R- 
Aaaa feared EANA A: afer gap. 


Here is the justification. The szghra-correction to the rate of motion is the dif- 
ference between the stghra-phalas of today and tomorrow. If that is derived in 
the same way as the manda-correction to the rate of motion, the result will be 
incorrect even if it were to be divided by the hypotenuse (sighra-karna)... The 
difference is not just due to the change in the anomaly [which is the argument 
of the Rsine] but also otherwise... The result of dividing by today’s hypotenuse 
is different from that of dividing by that of tomorrow. Even if the hypotenuses 
turn out to differ by small amount, the quantities they divide are large and thus 
a large difference could result. Hence, this way of approach [which was adopted 
in the case of manda-correction to the rate of the motion] has been forsaken and 
another has been devised by the great intellects. That is as follows... 


6.4. The equation of centre is extremum when the velocity correction van- 
ishes 


Later, in the Goladhyaya of Siddhàntasiromani, Bhaskara considers the situa- 
tion when the correction to the velocity (gati-phala) vanishes;? 


EU MESE ENSE/PECHHHSERSESIHHESÉE 
mag aft: Ber X Wo Ta WC 
Where the [North-South] line perpendicular to the [East-West] line of apsides 


through the centre of the concentric meets the eccentric, there the mean velocity 
itself is true and the equation of centre is extremum. 


In his Vasana, Bhaskara explains this correlation between vanishing of the 
velocity correction and the extrema of the correction to the planetary longitude:4 


arp fad quur visas uw a: unn 
mÀ mA: EE NFE Aa ca E WA T 
W FA TES WA Fe aaa Aro HATAR | 
ASARANA We: | POA AFERI 
mA ma was SEN Egi EGIAN 


para qf: eel gaea AY aft acacia fe 
TEM WH FHA 


The mean rate of motion itself is exact at the points where the line perpendicular 
[to the line of apsides], at the middle of the concentric circle, meets the eccentric 


43 Siddhantasiromani, cited above, Goladhyaya 4.39, p. 393. 
Mbid., Vasand on Goládhyáya 4.39. 
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circle. Because, there is no correction to the rate of motion [at those points]. Also, 
because there the equation of centre [or correction to the planetary longitudes] is 
extreme. Wherever the equation of centre is maximum, there the correction to the 
velocity should be absent. Because, the rate of motion is the difference between 
the planetary longitudes of today and tomorrow. The correction to the velocity is 
the difference between the equations of centre. The place where the correction to 
the velocity vanishes, there is a change over from positive to the negative. And, 
what has been stated by Lalla, "the mean rate of motion is itself true when the 
planet is on the intersection of the two circles [concentric and eccentric]". that is 
incorrect. The planet does not have maximum equation of centre at the confluence 
of the two circles. 


FIGURE 5b. Equation of centre is extremum where the correc- 
tion to velocity vanishes. 


Bhaskara explains that when the anomaly is ninety degrees, or the mean planet 
is at N along the line CN perpendicular to the line of apsides CE (see Figure 
5b), the equation of centre is maximum. It is precisely then that the correction to 
the velocity vanishes, as it changes sign from positive to negative. It is incorrect 
to state (as Lalla did in his Sisyadhivrddhida-tantra) that the correction to the 
velocity is zero at the point where the concentric and eccentric meet. 


7. Surface area and volume of a sphere 


In Aryabhatiya (Golapáda 7), the volume of a sphere has been incorrectly es- 
timated as the product of the area of a great circle by its square-root. Sridhara 
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(c. 750) seems to have given the correct expression for the volume of a sphere 
(Trisatika 56), though his estimate of z is fairly off the mark. Bhaskaracarya 
(c. 1150) has given the correct relation between the diameter, the surface area and 
the volume of a sphere in his Lilavatz*° 


mA WRÉORIURTeHTSTUTG: Ue AT 
qo Arah ui: waned mai 
Mead qafü w wd yest cuf 
aAa aA Aai Neo Bo EIE EST 


In a circle, the circumference multiplied by one-fourth the diameter is the area, 
which, multiplied by four, is its surface area going around it like a net around 
a ball. This [surface area] multiplied by the diameter and divided by six is the 
volume of the sphere. 


The surface area and volume of a sphere have been discussed in greater detail 
in the Siddhantasiromani (Golādhyāya 2.53-61), where Bhāskara has also pre- 
sented the upapatti or justification for the results in his commentary Vāsanā. As 
regards the surface area of the sphere, Bhaskara argues as follows:*° 


WY ouri TAN «uai tes VAT a e rd 
TPA d THRO (22800) Warcur qui murem fag 
Pea mR anomaan swíqawug] (ay) 
WSs Fa Tead | Tere faa: ra BERE 
worst fara agan araugfásfadarfa 
wafer! UM qurst ATERT: (aay) ger atA 
amaA E: | PASTA g AMA | qu MaA- 
TUS A AHHH: (29509)! Ts SuTqmd PAST aga | 
anit aaron A Roar quura saat | 


In order to make the point clear to a beginner, the teacher should demonstrate it 
on the surface of a sphere. Make a model of the earth in clay or wood and let its 
circumference be 21, 600 minutes. From the point at the top of the sphere at an 
arc-distance of 1/96!^ of the circumference, i.e., 225^, draw a circle. Similarly 
draw circles with twice, thrice,... twenty-four times 225’ [as the arc-distances] so 


45 Lilavati, cited above (fn. 5), verse 203, p. 79-80. 
46 Siddhántasiromani, cited above, Vasana on Goladhyàya 2.57, p. 362. 
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that there will be twenty-four circles. These circles will have as there radii Rsines 
starting from 225’. The measure [circumference] of the circle will be in propor- 
tion to these radii. Here, the last circle has a circumference 21, 600’ and its radius 
is 3, 438/. The Rsines multiplied by 21, 600 and divided by the radius [3, 438] 
will give the [circumference] measure of the circles. Between any two circles, 
there is an annular region and there are twenty-four of them. If more [than 24] 
Rsines are used, then there will be as many regions. In each figure [if it is cut and 
spread across as a trapezium] the larger lower circle may be taken as the base and 
the smaller upper circle as the face and 225' as the altitude and the area calculated 
by the usual rule: [Area is] altitude multiplied by half the sum of the base and 
face. The sum of all these areas is the area of half the sphere. Twice that will be 
the surface area of the entire sphere. That will always be equal to the product of 
the diameter and the circumference. 


Here Bhaskara is taking the circumference to be C = 21, 600’, and the corre- 
sponding radius is approximated as R 7 3, 438’. As shown in Figure 6, circles 
are drawn parallel to the equator of the sphere, each separated in latitudes by 225’. 
This divides the northern hemisphere into 24 strips, each of which can be cut and 
spread across as a trapezium. If we denote the 24 tabulated Rsines by B1, B»... 
B24, then the area A j of j-th trapezium will be 


C B;+B; 
Aj = (5) Cit Bit qs 


Therefore, the surface area S of the sphere is estimated to be 


S=2 (5) |^ T bur (=) (225). (68) 


Now, Bhaskara states that the right hand side of the above equation reduces to 
2CR. This can be checked by using Bhaskara’s Rsine-table. Bhaskara himself 
has done the summation of the Rsines in his Vasand on the succeeding verses,” 
where he gives another method of derivation of the area of the sphere, by cutting 
the surface of the sphere into lunes. In that context, he computes the sum 


B R 
By + B5... + B3 + (=) = B+ Bp.... + Bs Ba (2) 
e 54233 — 1719 = 52514. (69) 


47 Siddhantasiromani, cited above, Vásanà on Golàdhyaya 2.58—61, p. 364. 
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FIGURE 6. Surface area of a sphere. 


Thus, according to Bhaskara’s Rsine table 


Il 


To" 52514 x (225) 


E 5955 dec 0 ENT (x) (225) 


11815650 
(3437.39)?. (70) 


Q 


Taking this as R? = (3438)?, we obtain the surface area of the sphere to be p 


C 2 1 
s=2(5)R =2CR. (71) 


Of course, the grossness of the result (70) is due to the fact that the quadrant of the 
circumference was divided into only 24 bits. Bhaskara also mentions that we may 
consider dividing the circumference into many more arc-bits, instead of the usual 
24 divisions which are made for computing Rsine-tables. This is the approach 
taken in Yuktibhasa, where the circumference of the circle is divided into a large 


48As has been remarked by one of the reviewers, it is indeed intriguing the Bhaskara chose to 
sum the tabular Rsines numerically, instead of making use of the relation between Rsines and Rcosine- 
differences which was well known since the time of Aryabhata. In fact, the proof given in Yuk- 
tibhasa (cited below in fn. 49) makes use of the relation between the Rsines and the second order 
Rsine-differences to estimate this sum. 


CC-0. Gurukul Kangri Collection, Haridwar 


MENU EN UMEN uuuuiamqaasU———— — — EE 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Development of Calculus in India 235 


number, n, of equal arc-bits. If A is the Rsine of each arc-bit, the surface area is 
estimated to be 


C 
s=2($) 6: + B2 + ....Bn)(A). (72) 


Then it is shown that in the limit of large n, 


(Bj + B2 +....By)(A) © R?, (13) 
which leads to the result 2C R for the surface area.*” 
As regards the volume of a sphere, Bhaskara's justification is much simpler:?? 


The surface area of a sphere multiplied by its diameter and divided by six is its 
volume. Here is the justification. As many pyramids as there are units in the sur- 
face area with bases of unit side and altitude equal to the semi-diameter should 
be imagined on the surface of the sphere. The apices of the pyramids meet at the 
centre of the sphere. Then the volume of the sphere is the sum of the volumes 
of the pyramids and thus our result is justified. The view that the volume is the 
product of the area times its own root, is perhaps an alien view (paramata) that 
has been presented by Caturavedacarya [Prthüdakasvàmin |]. 


We may note that it is the Aryabhatiya rule which is referred to as paramata 
in the above passage. Bhaskara's derivation of the volume of a sphere is similar 
to that of the area of a circle by approximating it as the sum of the areas of a 
large numbers of triangles with their vertices at the centre, which is actually the 
proof given in Yuktibhasa. In the case of the volume of a sphere, Yuktibhasa, 
however, gives the more "standard" derivation, where the sphere is divided into 
a large number of slices and the volume is found as the sum of the volumes of 
the slices—which ultimately involves estimating the sum of squares of natural 
numbers (varga-sarikalita), 12 +22 +32 +... -- n?, for large nl 


49 Ganita-yukti-bhasd, cited above, Section 7.18, pp. 140-42, 261-63, 465-67. In modem ter- 
x 


minology, this amounts to the evaluation of the integral fr Rsin@Rd@ = R?. 
50 Siddhantasiromani, cited above, Vasana on Golddhyaya 2.61, p. 364. 
5! Ganita-yukti-bhasa, cited above, Section 7.19, pp. 142-45, 263-66, 468-70. 
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PART II : WORK OF THE KERALA SCHOOL 


Madhava to Sankara Variyar (c. 1350-1550 CE) 


8. Kerala School of Astronomy 


The Kerala School of Astronomy in the medieval period, pioneered by Madhava 
(c. 1340-1420) of Sangamagrama, extended well into the 19th century as exem- 
plified in the work of Sankaravarman (c. 1830), Raja of Kadattanadu. Only 
a couple of astronomical works of Madhava ( Venvaroha and Sphutacandrapti) 
seem to be extant now. Most of his celebrated mathematical discoveries—such as 
the infinite series for z and the sine and cosine functions—are available only in 
the form of citations in later works. 

Madhava's disciple Paramesvara (c. 1380-1460) of Vatasseri, is reputed to 
have carried out detailed observations for over 50 years. A large number of orig- 
inal works and commentaries written by him have been published. However, his 
most important work on mathematics, the commentary Vivarana on Lilavati of 
Bhaskara II, is yet to be published. 

Nilakantha Somayaji (c. 1444-1550) of Kundagrama, disciple of 
Parame$vara's son Damodara (c. 1410-1520), is the most celebrated mem- 
ber of Kerala School after Madhava. Nilakantha has cited several important 
results of Madhava in his various works, the most prominent of them being 
Tantrasarigraha (c. 1500) and Aryabhatiya-bhasya. In the latter work, while 
commenting on Ganitapada of Aryabhatiya, Nilakantha has also dealt exten- 
sively with many important mathematical issues. 

However, the most detailed exposition of the work of the Kerala School, start- 
ing from Madhava, and including the seminal contributions of Paramesvara, 
Damodara and Nilakantha, is to be found in the famous Malayalam work Yuk- 
tibhasa (c. 1530) of Jyesthadeva (c. 1500-1610). Jyesthadeva was also a dis- 
ciple of Damodara but junior to Nilakantha. The direct lineage from Madhava 
continued at least till Acyuta Pisarati (c. 1550-1621), a disciple of Jyesthadeva, 
who wrote many important works and a couple of commentaries in Malayalam 
also. 

At the very beginning of Yuktibhasa, Jyesthadeva states that he intends to 
present the rationale of the mathematical and astronomical results and procedures 
which are to be found in Tantrasarigraha of Nilakantha. Yuktibhasa, compris- 
ing 15 chapters, is naturally divided into two parts, Mathematics and Astronomy. 
Topics in astronomy proper, so to say, are taken up for consideration only from the 
eighth chapter onwards, starting with a discussion on mean and true planets. 
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The first seven chapters of Yuktibhasa are in fact in the nature of an inde- 
pendent treatise on mathematics and deal with various topics which are of rele- 
vance to astronomy. It is here that one finds detailed demonstrations of the re- 
sults of Madhava such as the infinite series for z, the arc-tangent, sine and the 
cosine functions, the estimation of correction terms and their use in the genera- 
tion of faster convergent series. Demonstrations are also provided for the classi- 
cal results of Aryabhata (c. 499) on kuttakara (linear indeterminate equations), 
of Brahmagupta (c. 628) on the diagonals and the area of a cyclic quadrilat- 
eral, and of Bhaskara II (c. 1150) on the surface area and volume of a sphere. 
Many of these rationales have also been presented mostly in the form of San- 
skrit verses by Sankara Variyar (c. 1500-1560) of Trkkutaveli in his commen- 
taries Kriyakramakari (c. 1535) on Lilavati of Bhaskara II and Yukti-dipika on 
Tantrasangraha of Nilakantha. In fact, Sankara Variyar ends his commentary 
on the first chapter of Tantrasarigraha with the acknowledgement:;?? 


Zc weererarafganrrzicdt ATS: | 
Wo q anges aH JÄSA qur ef: 


Whatever has been the meaning as expounded by the noble dvija of Parakroda 
[Jyesthadeva] the same has now been stated by me for the first chapter of 
Tantrasarigraha. 


In the following sections we shall present an overview of the contribution of the 
Kerala School to the development of calculus (during the period 1350—1500), fol- 
lowing essentially the exposition given in Yuktibhasa. In order to indicate some 
of the concepts and methods developed by the Kerala astronomers, we first take 
up the issue of irrationality of z and the summation of infinite geometric series 
as discussed by Nilakantha Somayaji in his Aryabhatiya-bhasya. We then con- 
sider the derivation of binomial series expansion and the estimation of the sum of 
integral powers of integers, 1^ + 2* +... + n* for large n, as presented in Yuk- 
tibhasa. These results constitute the basis for the derivation of the infinite series 
for $ due to Madhava. We shall outline this as also the very interesting work 
of Madhava on the estimation of the end-correction terms and the transforma- 
tion of the z -series to achieve faster convergence. Finally we shall summarize the 
derivation of the infinite series for Rsine and Rcosine due to Madhava. 

In the final section, we shall deal with another topic which has a bearing on 
calculus, but is not dealt with in Yuktibhasa, namely the evaluation of the in- 
stantaneous velocity of a planet. Here, we shall present the result of Damodara, 
as cited by Nilakantha, on the instantaneous velocity of a planet which involves 


52 Tantrasarigraha of Nilakantha Somayaji, Ed. with Yukti-dipika of Sankara Variyar by 
K. V. Sarma, Hoshiarpur 1977, p. 77. The same acknowledgement appears at the end of the subsequent 
chapters also. 
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the derivative of the arc-sine function. There are indeed many works and com- 
mentaries by later astronomers of the Kerala School, whose mathematical contri- 
butions are yet to be studied in detail. We shall here cite only one result due to 
Acyuta Pisarati (c. 1550-1621), a disciple of Jyesthadeva, on the instantaneous 


velocity of a planet, which involves the evaluation of the derivative of the ratio of 
two functions. 


9. Nilakantha's discussion of irrationality of z 


In the context of discussing the procedure for finding the approximate square root 
of a non-square number, by multiplying it by a large square number (the method 
given in Trigatika of Sridhara referred to earlier in Section 3.3), Nilakantha 
observes in his Aryabhatiya-bhasya:>° 


Ud pisaa Àa AS MIA Ta: SUNS qud: 
Wie: cud Ter ECUPNWIS: | ddl qatar CHIGI 
QTA HEN To gA 


Even if we were to proceed this way, the square root obtained will only be ap- 
proximate. The idea [that is being conveyed] is, that it is actually not possible 
to exactly de-limit (paricchedah) the square root of a non-square number. Pre- 
cisely for this reason, multiplication by a large square was stated (recommended) 
in order to get as much accuracy as desired. 


Regarding the choice of the large number that must be made, it is mentioned 


that one may choose any number—as large a number as possible—that gives the 
desired accuracy. 


aa Fad Hed Pr WuT4OPHTd: AA a FATA! 
meaa amara AA a RaR ara: | 


You can multiply by whichever large number you want up to your satisfaction 
(buddhavalambhavah). Since largeness is a relative notion, it may be under- 
stood that the process is an unending one. 


In this context, Nilakantha cites the verse given by Aryabhata specifying the 
ratio of the circumference to the diameter of a circle (value of 7), particularly 


drawing our attention to the fact that Aryabhata refers to this value as “approxi- 
» 55 
mate . 


53 Aryabhatiya of Aryabhata, Ed. with Aryabhatzya-bhasya of Nilakantha Somayājī by 
K. Sam-basiva Sastri, Trivandrum Sanskrit Series 101, Trivandrum 1930, comm. on Ganitapada 
4, p. 14. 


54tbiq, 
55rbid. 
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qaf a - “aaqagafae aren areal quume: ate 
aa ada Rast aan em dqeszivafu 
AU wate TA AAAA werd qug 
va yfararefaeara: | 


As will be stated [by the author himself] — ‘this is [only] an approximate measure 
of the circumference of the circle whose diameter is twenty-thousand.” In finding 
the circumference from the diameter, a series of inferences are involved. The 
approximate nature of this also stems from the fact that it involves finding square 
roots. All this will be explained later at the appropriate context. 


239 


Addressing the issue—later in his commentary, as promised earlier—while dis- 


cussing the value of z Nilakantha observes: 


56 


The relation between the circumference and the diameter has been presented. 

.. Approximate: This value (62,832) has been stated as only an approximation to 
the circumference of a circle having a diameter of 20,000. “Why then has an ap- 
proximate value been mentioned here instead of the actual value?” It is explained 
[as follows]. Because it (the exact value) cannot be expressed. Why? 


Explaining as to why the exact value cannot be presented, Nilakantha contin- 


es;?7 


aa Aa Ararat cara: fdxqua: un. 

qiia: AE NUM vifa: 
Paaa: qus curd aris urea ya; afa 
Year mA Gara o THat: gfe a naaa 
CATA HEDATH MATA TANT Acasa Ys OUI 
forged q ggf a maa afd ume: | 


Given a certain unit of measurement (mana) in terms of which the diameter 
(vyàsa) specified [is just an integer and] has no [fractional] part (niravayava), 
the same measure when employed to specify the circumference (paridhi) will 
certainly have a [fractional] part (s@vayava) [and cannot be just an integer]. 
Again if in terms of certain [other] measure the circumference has no [fractional] 
part, then employing the same measure the diameter will certainly have a [frac- 
tional] part [and cannot be an integer]. Thus when both [the diameter and the 
circumference] are measured by the same unit, they cannot both be specified [as 


5Ópbid., comm. on Ganitapada 10, p. 41. 
5ybid., pp. 41-42. 
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integers] without [fractional] parts. Even if you go a long way (i.e., keep on re- 
ducing the measure of the unit employed), the fractional part [in specifying one 
of them] will only become very small. A situation in which there will be no [frac- 
tional] part (i.e, both the diameter and circumference can be specified in terms of 
integers) is impossible, and this is the import [of the expression ásanna]. 


Evidently, what Nilakantha is trying to explain here is the incommensurability 
of the circumference and the diameter of a circle. Particularly, the last line of 
the above quote—where Nilakantha clearly mentions that, however small you 
may choose your unit of measurement to be, the two quantities will never become 
commensurate—is noteworthy. 


10. Nilakantha's discussion of the sum of an infinite geometric series 


In his Aryabhatzya-bhásya, while deriving an interesting approximation for the 
arc of a circle in terms of the jyā (Rsine) and the Sara (Rversine), Nilakantha 
presents a detailed demonstration of how to sum an infinite geometric series. The 
specific geometric series that arises in this context is: 


INN Bb (iiy? x it] 
ai la EAG EZ: 


We shall now present an outline of Nilakantha’s argument that gives an idea 
of how the notion of limit was understood in the Indian mathematical tradition. 


10.1. Nilakantha's approximate formula for the arc in terms of jyã and sara 


In Figure 7, AB is the arc whose length (assumed to be small) is to be determined 
in terms of the chord lengths AD and B D. In the Indian mathematical literature, 
the arc AB, the semi-chord A D and the segment B D are referred to as the capa, 
jyardha and sara respectively. As can be easily seen from the figure, this termi- 
nology arises from the fact that these geometrical objects look like a bow, a string 
and an arrow respectively. Denoting them by c, j, and s, the expression for the arc 
given by Nilakantha may be written as: 


de ( + 3) $4 2. (74) 
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FIGURE 7. Arc-length in terms of jyà and Sara. 


Nilakantha's proof of the above equation has been discussed in detail by Saras- 
vati Amma.?? It may also be mentioned that the above approximation actually 
does not form a part of the text Aryabhatzya; but nevertheless it is introduced by 
Nilakantha while commenting upon a verse in Aryabhatiya that gives the arc 


in terms of the chords in a circle.?? The verse that succinctly presents the above 


equation (74) goes as follows: 


aaay Saeed Te Wd: HTS: | 
The arc is nearly (práyah) equal to the square root of the sum of the square of 
the sara added to one-thirds of it, and the square of the 7yà. 


The proof of (74) given by Nilakantha involves: 
(1) Repeated halving of the arc-bit, capa c to get c] .. . Ci 


(2) Finding the corresponding semi-chords, jya (j;) and the Rversines, 
Sara (si). 


(3) Estimating the difference between the capa and jyd at each step. 


If 5; denotes the difference between the capa and jyà at the i‘ step, that is, 


ĝi = ci — Ji; 


58T. A. Sarasvati Amma, cited above (fn. 21), pp. 179-182. 

sga TT: AIST: uw wo WAM: | Aryabhatiya, Ganitapada, verse 17). 

60 4 ryabhatiya-bhàsya on Aryabhatiya, Gt above (fn. 50), comm. on Ganitapada 12 and 
17, p. 63 and p. 110. That the verse cited is from another work of his, namely GolasGra, has been 


alluded to by Nilakantha in both the instances of citation. 
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then it is seen that this difference decreases as the size of the capa decreases. 
Having made this observation, Nilakantha proceeds with the argument that 


e Generating successive values of the j;-s and s;-s is an ‘unending’ process 
(na kvacidapi paryavasyati) as one can keep on dividing the capa into half 
ad infinitum (anantyat vibhagasya). 


e It would therefore be appropriate to proceed up to a stage where the differ- 
ence d; becomes negligible ($ünyapraya) and make an ‘intelligent approxi- 
mation’, to obtain the value of the difference between c and j approximately. 

The original passage in Aryabhatzya-bhàsya which presents the above argument 
reads as follows:°! 
SMTA WA: Wu: Sas uTWupTUTCc- 
rae Taha ate se F 
adana a sfeft oieefe amore faunmurn 
ad: feast wes ater wom Saar 
FAIA AA WISTS TF TITS Heat Tara 
SAAT Aaa Fras S 


10.2. Nilakantha’s summation of the infinite geometric series 


The question that Nilakantha poses as he commences his detailed discussion on 
the sum of geometric series is very important and arises quite naturally whenever 
one encounters the sum of an infinite series: 


SY V4: Wadd qud agud q ? 
How do you know that [the sum of the series] increases only up to that [limiting 


value] and that it certainly increases up to that [limiting value]? 


Proceeding to answer the above question, Nilakantha first states the general 


result 
1 NC FON a 
a —-}+{- +i- +... = å 
r r r r—i 


Here, the left hand side is an infinite geometric series with the successive terms 
being obtained by dividing by a common divisor, r, known as cheda, whose value 


61}pid., comm. on Ganitapada 17, pp. 104-05. 
G?rbi., p. 106. 
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is assumed to be greater than 1. He further notes that this result is best demon- 
strated by considering a particular case, say r = 4. In his own words: 


TAG Vd U: Teo: Aaa: YT 
Tart: qur WaT Aare eer ay Areata: 
Wea TRAITS Hia DATA Taya 
- arenas aad wast wearer | 


It is being explained. Thus, in an infinite (ananta) geometrical series (tulya- 
ccheda-parabhdga-parampar4) the sum of all the infinite number of terms con- 

sidered will always be equal to the value obtained by dividing by a factor which is 
one less than the common factor of the series. That this is so will be demonstrated 
by first considering the series obtained with one-fourth (caturamsa-parampara). 


What is intended to be demonstrated is 


TORORO 


Besides the multiplying factor a, it is noted that, one-fourth and one-third are 
the only terms appearing in the above equation. Nilakantha first defines these 
numbers in terms of one-twelfth of the multiplier a referred to by the word rast. 
For the sake of simplicity we take the ras? to be unity. 


(75) 


WIS 


Having defined them, Nilakantha first obtains the sequence of results, 


p fd 1 
3 EM 
MO ij l 

43) ^ 44 G43)’ 

1 D 1 l 

(4.4.3) — (44.4) 444.3)’ 


and so on, which leads to the general result, 
qu E NE We 1\"/1 
--|- - oe » ={- i]. 76 
3 cr) aE *() G) (5) Med 


63Ibid., pp. 106-07. 
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Nilakantha then goes on to present the following crucial argument to derive the 
sum oF the infinite geometric Eodem As we sum more terms, the difference be- 
tween 3 L and sum of powers of 1 3 (as given by the right hand side of (76)), becomes 
extremely small, but never zero. Only when we take all the terms of the infinite 
series together do we obtain the equality 


1 1 2 1 n l 


A brief extract from the text presenting the above argument is given below:64 


à mAg: qui fae fg qug wqub ow aT | 


aisa: weft werd o wquued S | 
qug V4: Warhead + hae qu Qn, 


areeaapera ufafedrar 
ayi: create ferra a soiree afore s FF i 


Three times one-twelfth of a ràsi is one-fourth (caturamSa) [of that rasi]. Four 
times that is one-third (tryam$a). [Considering] four times that [one-twelfth of 
the rasi] which is one-third, three by fourth of that fals short by one-fourth [of 
one-third of the ras;]. Three-fourths of that [i.e., of qh of the rási] which is 
one-fourth of that (£ryamáa), again falls short [of the same] by one-fourth of 
one-fourth [of one-third of the rasi] ... 


Since the result to be demonstrated or the process to be carried out is never 
ending (anantyat) and the difference though very small (atiszksmatvat) [still 
exists and the sum of the series] cannot be simply taken to be one-third. It seems 
that the process is incomplete since always something remains because of its never 
ending nature. In fact, since in all the problems involving [infinite] series, by 
bringing in all the terms and placing them together, the process would [in princi- 
ple] become complete, here, in the mathematics involving repeated multiplication 
of one-fourth, a similar conclusion may be drawn. 


6^jbiq. p. 107. 
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11. Derivation of binomial series expansion 


Yuktibhasa presents a very interesting derivation of the binomial series for (1 + 
x)^! by making iterative substitutions in an algebreic identity. The method given 
in the text may be summarized as follows.5? 

Consider the product a (7), where some quantity a is multiplied by the multi- 
plier c, and divided by the divisor 5. Here, a is called gunya, c the gunaka and 
b the hara, which are all assumed to be positive. Now the above product can be 


rewritten as: 


a (E) =a-a £79. (78) 


In the expression a (bee) in (78) above, if we want to replace the division by b 
(the divisor) by division by c (the multiplier), then we have to make a subtractive 


correction (called sodhya-phala) which amounts to the following equation. 


2 520) ag Ex) — (S22 x 52). (79) 
b c c b 


Now, in the second term (inside parenthesis) in (79)—which is what we referred 
to as Sodhya-phala, which literally means a quantity to be subtracted—if we again 
replace the division by b by division by c, then we have to employ the relation (79) 
once again to get another subtractive term 


c [e2 (b — c) zx 
= a —|a——— —a 


x 
c c b 


| (b — c) (b—c) (b-c) | 
(^io | (a! — a——_ x ——— x — 
c c c b 


(b — c) (b — cy? (b-c) (b-c) l 
pep- [225 - (108, aT 


Here, the quantity aba" is called dvitzya-phala or simply dvitiya and the 
one subtracted from that is dvittya-godhya-phala. If we carry out the same set of 
operations, the m‘* sodhya-phala subtracted from the m'" term will be of the form 


[=e] [=ef (b — c) 
a c OY x . 


c c b 


a 


Il 


65 Ganita-yukti-bhasa, cited above, Vol. I, Sections 6.3.34, pp. 54-58, 188-191, 375-378. 
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Since the successive Sodhya-phalas are subtracted from their immediately pre- 
ceding term, we will end up with a series in which all the odd terms (leaving 
out the gunya, a) are negative and the even ones positive. Thus, after taking m 
Sodhya-phalas we get 


= a-a af ORO] — + eanO | P= 2 


c 
na 
b c c c 


+(—1)”+!a [2-2] Ga (81) 


G b 


Regarding the question of termination of the process, both the texts Yuktibhasa 
and Kriyakramakari clearly mention that logically there is no end to the process 


of generating sodhya-phalas. We may thus write our result as: 


2 m-1 
ac = ama ga [t2] -+ D"a [22] 
b G 1 C c 
+nra| =] T (82) 


It is also noted that the process may be terminated after having obtained the 
desired accuracy by neglecting the subsequent phalas as their magnitudes become 
smaller and smaller. In fact, Kriyakramakari explicitly mentions the condition 
under which the succeeding phalas will become smaller and smaller: 


Ya We: Gora Paste fea: gifs a carte: acurfa 
aadA TEAC WISTCHTeTWU Saat HANT- 
AAA SENNA OTA AAs T TENT TOE NTT 
Yq arc! 

Thus, even if we keep finding the phalas repeatedly, logically there is no end 
to the process. Even then, having carried on the process to the desired accu- 
racy (yavadapeksam süksmatamapadya), one should terminate computing 
the phalas by [simply] neglecting the terms that may be obtained further (pasca- 
tyanyupeksya). Here, the succeeding phalas will become smaller and smaller 


only when the difference between the gunaka and hàra is smaller than gunaka 
[that is (b ~ c) < c]. 


661 may be noted that if we set =o) = x, then 5 = Wap: Hence, the series (82) is none other 
than the well known binomial series 
a 
l+x 


which is convergent for —1 < x < 1. 
67 Kriyakramakari on Lilàvati, cited above (fn. 14), comm. on verse 199, p. 385. 


—a-—ax-ax? — ... 4 (-1" ax" +... , 
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12. Estimation of sums of 1* + 2* + ...n* for large n 


As mentioned in section 4.1, Aryabhata has given the explicit formulae for the 
summation of squares and cubes of integers. The word employed in the In- 
dian mathematical literature for summation is sarikalita. The formulae given by 
Aryabhata for the sarikalitas are as follows: 


| 
Sd T ne SED 
so = P4224. pnt T0 DOE D) 
1 2 


From these, it is easy to estimate these sums when z is large. Yuktibhasa gives a 
general method of estimating the sama-ghata-sarikalita 


gU Eno n (84) 


when z is large. The text presents a general method of estimation, which does not 
make use of the actual value of the sum. In fact, the same argument is repeated 
even for k — 1,2,3, although the result of summation is well known in these 
cases. 


12.1. The sum of natural numbers (Müla-sarikalita) 


Yuktibhasa takes up the discussion on sarikalitas in the context of evaluating the 
circumference of a circle which is conceived to be inscribed in a square. It is 
half the side of this square that is being referred to by the word bhuja in both 
the citations as well as explanations offered below. Half of the side of the square 
(equal to the radius) is divided into n equal bits, known as bhuja-khandas. It is 
these bhuja-khandas (E) , 2 (E) --- whose powers are summed. 

To start with, Yuktibhasa discusses just the basic summation of bhuja-khandas 
called Müla-sarikalita. We now cite the following from the translation of Yuk- 
tibhasa:®® 

Now is described the methods of making the summations (referred to in the ear- 
lier sections). At first, the simple arithmetical progression (kevala-sarikalita) is 
described. This is followed by the summation of the products of equal numbers 
(squares). ... 

Here, in this müla-sarikalita (basic arithmetical progression), the final bhujá is 


68 Ganita-yukti-bhasa, cited above, Section 6.4, pp. 61-67, 192-97, 382-88. 
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equal to the radius. The term before that will be one segment (khanda) less. The 
next one will be two segments less. Here, if all the terms (bhujas) had been equal 
to the radius, the result of the summation would be obtained by multiplying the 
radius by the number of bhujás. However, here, only one bhujd is equal to the 
radius. And, from that bhuja, those associated with the smaller hypotenuses are 
less by one segment each, in order. Now, suppose the radius to be of the same 
number of units as the number of segments to which it has been divided, in order 
to facilitate remembering (their number). Then, the number associated with the 
penultimate bhuja will be less by one (from the number of units in the radius); 
the number of the next one, will be less by two from the number of units in the 
radius. This reduction (in the number of segments) will increase by one (at each 
step). The last reduction will practically be equal to the measure of the radius, for 
it will be less only by one segment. In other words, when the reductions are all 
added, the sum thereof will practically (pr@yena) be equal to the summation of 
the series from 1 to the number of units in the radius; it will be less only by one 
radius length. Hence, the summation will be equal to the product of the number 
of units in the radius with the number of segments plus one, and divided by 2. 
The summation of all the bhujds of the different hypotenuses is called bhuja- 
sankalita. 

Now, the smaller the segments, the more accurate (sūksma) will be the result. 
Hence, do the summation also by taking each segment as small as an atom (anu). 
Here, if it (namely, the bhuja or the radius) is divided into parardha (a very 
large number) parts, to the bhuja obtained by multiplying by parardha add one 
part in parardha and multiply by the radius and divide by 2, and then divide by 


parardha. For, the result will practically be the square of the radius divided by 
two. ... 


The first summation, the bhuja-sankalita, may be written in the reverse order 
from the final bhuja to the first bhuja as 


si = (=) (S797)... (2). (85) 


Now, conceive of the bhuja-khanda 7 as being infinitesimal (aru) and at the same 
time as of unit-measure (rüpa), so that the radius will be the measure of n, the 
pada, or the number of terms. Then 


SD — n-E (n — 1) -.... - 1. (86) 


If each of the terms were of the measure of radius (n) then the sum would be 
nothing but n?, the square of the radius. But only the first term is of the measure 
of radius, the next is deficient by one segment (khanda), the next by two segments 
and so on till the last term which is deficient by an amount equal to radius-minus- 
one segment. In other words, 


is 


n4 [n — 1] - [n — 2].... + [n — (n 2 2) + [n — (n — 1)] 
= nn-—[1-42-..- (n— 1)]. (87) 
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When n is very large, the quantity to be subtracted from n? is practically (prayena) 
the same as SJ, thus leading to the estimate 


SD a n?— si), (88) 
2 
or SD x T (89) 


It is stated that the result is more accurate, when the size of the segments are small 
(or equivalently, the value of n is large).9? 

If instead of making the approximation as in (88), we proceed with (87) as it is, 
we get sg) =n? — (s(? — n), which leads to the well-known exact value of the 
sum of the first natural numbers 


n(n 4- 1) 


S me 


, (90) 


With the convention that the 7 is of unit-measure, the above estimate (89) is stated 
in the form that the bhuja-sarikalita is half the square of the radius. 


12.2. Summation of squares ( Varga-sankalita) 


We now cite the following from the translation of Yuktibhasa:”° 

Now is explained the summation of squares (varga-sarikalita). Obviously, the 
squares of the bhujds, which are summed up above, are the bhujas each multi- 
plied by itself. Here, if the bhujas which are all multipliers, had all been equal to 
the radius, their sum, (sarikalita derived above), multiplied by the radius would 
have been the summation of their squares. Here, however, only one multiplier 
happens to be equal to the radius, and that is the last one. The one before that will 
have the number of segments one less than in the radius. (Hence) if that, (i.e., the 
second one), is multiplied by the radius, it would mean that one multiplied by the 
penultimate bhuja would have been the increase in the summation of the squares. 
Then (the segment) next below is the third. That will be less than the radius by 
two segments. If that is multiplied by the radius, it will mean that, the summa- 
tion of the squares will increase by the product of the bhuja by two (segments). 
In this manner, the summation in which the multiplication is done by the radius 
(instead of the bhujas) would be larger than the summation of squares by terms 


S9$ankara Variyar also emphasizes the same idea, in his discussion of the estimation of 
sarikalitas in his commentary Kriyakramakari on Lilavati (cited above (fn. 14), comm. on verse 
199, p. 382.): 


WSs Us Ta UNT FT Wann 
Only when the segment is small (khandasyalpatve) the result obtained would be 
accurate. 


70 Ganita-yukti-bhàsa, cited above, Section 6.4, pp. 61-67, 192-97, 382-88. 
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which involve the successively smaller bhujds multiplied by successively higher 
numbers. If (all these additions) are duly subtracted from the summation where 
the radius is used as the multiplier, the summation of squares (varga-sarikalita) 
will result. 

Now, the bhujd next to the east-west line is less than the radius by one (segment). 
So if all the excesses are summed up and added, it would be the summation of the 
basic summation (miila-sa7ikalita-sankalita). Because, the sums of the sum- 
mations is verily the ‘summation of summations’ (sarikalita-sarikalita). There, 
the last sum has (the summation of) all the bhujas. The penultimate sum is next 
lower summation to the last. This penultimate sum is the summation of all the 
bhujàs except the last bhuja. Next to it is the third sum which is the sum of all 
the bhujàs except the last two. Thus, each sum of the bhujds commencing from 
any bhujà which is taken to be the last one in the series, will be less by one bhuja 
from the sum (of the bhujas) before that. 

Thus, the longest bAujais included only in one sum. But the bhuja next lower 
than the last (bhujà) is included both in the last sum and also in the next lower 
sum. The bhujas below that are included in the three, four etc. sums below it. 
Hence, it would result that the successively smaller bhujas commencing from the 
one next to the last, which have been multiplied by numbers commencing from 1 
and added together, would be summation of summations (sarikalita-sarnikalita). 
Now, it has been stated earlier that the summation (sarikaliía) of (the segments 
constituting) a bhujà which has been very minutely divided, will be equal to 
half the square of the last bhuja. Hence, it follows that, in order to obtain the 
summation (sarikalita) of the bhujas ending in any particular bhujà, we will 
have to square each of the bhujas and halve it. Thus, the summation of summa- 
tions (sarikalita-sarikalita) would be half the summation of the squares of all the 
bhujas. In other words, half the summation of the squares is the summation of the 
basic summation. So, when the summation is multiplied by the radius, it would 
be one and a half times the summation of the squares. This fact can be expressed 
by stating that this contains half more of the summation of squares. Therefore, 
when the square of the radius divided by two is multiplied by the radius and one- 
third of it subtracted from it, the remainder will be one-third of the whole. Thus it 
follows that one-third of the cube of the radius will be the summation of squares 
(varga-sarikalita). 


With the same convention that x is the measure of the unit, the bhuja-varga- 
sankalita (the sum of the squares of the bhujas) will be 


Sas (mall) > c a (91) 

In above expression, each bhujà is multiplied by itself. If instead, we consider that 

each bhuja is multiplied by the radius (n in our units), then that would give raise 
to the sum ; 

n[n 4- (n — 1) - ...-- 1] 2 n SP. (92) 


This sum is exceeds the bhuja-varga-sankalita by the amount 


nS) — s = 1.(n — 1) -2.(n — 2) -3.(n 23) +... + (n — 1.1. 
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This may be written in the form 


nS — 52 = (nm —1)+(n-2)+(n-3) +... +I 
+(n—2)+(n-—3) +... 41 
+(n—3) +... 41 
+ (93) 
Thus, 
a (2 (1) (1) (1) 
n$$) — S iJ = SS SS DEDIS SEL (94) 


The right hand side of (94) is called the sarikalita-sarikalita (or sankalitaikya), 
the repeated sum of the sums sm (here taken in the order; = n —1, n —2,...1). 
These are defined also by Sankara Variyar in Kriyakramakari as follows:”! 


The sum of the summations is called as sarikalita-sarikalita. Of them the last 
sarikalita is the sum all the bhujas. The penultimate sarikalita is the sum of 
all the bhujàs other than the last one. The sarikalita of the one preceding the 
penultimate is the sum of the bhujas ending with that. Thus, all the preceding 
sankalitas will fall short by a bhuja from the succeeding sarikalita. 


For large n, we have already estimated in (89) that si) A e Thus, for large n 


(n—-1?  (n-2 | (n—-3y 


2 2 2 Ee (95) 


nS) — sQ x 


Thus, the right hand side of (94) (the sarikalita-sarikalita or the excess of n (1) 


2 H C (2) H 
over s» is essentially Sa for large n, so that we obtain 


nS? — SQ as 2 (96) 


71 Kriyakramakari on Lilàvati, cited above (fn. 14), comm. on verse 199, pp. 382-83. 
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Again, using the earlier estimate (89) for SO. we obtain the result 


n? 


(2) ~ 
Sue 


(97) 


Thus bhujà-varga-sarikalita is one-third the cube of the radius. 


12.3. Sama-ghata-sankalita 


We now cite the following from the translation of Yuktibhasa:’” 


Now, the square of the square (of a number) is multiplied by itself, it is called 
sama-panca-ghata (number multiplied by itself five times). The successive 
higher order summations are called sama-paricadi-ghata-sarikalita (and will 
be the summations of powers of five and above). Among them if the summation 
(sarikalita) of powers of some order is multiplied by the radius, then the product 
is the summation of summations (sarikalita-sarkalita) of the (powers of 
the) multiplicand (of the given order), together with the summation of powers 
(sama-ghata-sarikalita) of the next order. Hence, to derive the summation of 
the successive higher powers: Multiply each summation by the radius. Divide it 
by the next higher number and subtract the result from the summation got before. 
The result will be the required summation to the higher order. 


Thus, divide by two the square of the radius. If it is the cube of the radius, divide 
by three. If it is the radius raised to the power of four, divide by four. If it is 
(the radius) raised to the power of five, divide by five. In this manner, for powers 
rising one by one, divide by numbers increasing one by one. The results will be, 
in order, the summations of powers of numbers (sama-ghàta-sankalita). Here, 
the basic summation is obtained from the square, the summation of squares from 

- the cube, the summation of cubes from the square of the square. In this manner, 
if the numbers are multiplied by themselves a certain number of times (i.e., raised 
to a certain degree) and divided by the same number, that will be the summation 
of the order one below that. Thus (has been stated) the method of deriving the 
summations of (natural) numbers, (their) squares etc. 


In the case of a general samaghata-sankalita, (summation of equal powers) given 
by 


SO — n p (n — DF 4... 41%, (98) 


the procedure followed to estimate its behavior for large n is essentially the same 
as that followed in the case of vargasarikalita. We first compute the excess of 
ns¢-) over se to be a sankalita-sankalita or repeated sum of the lower order 


72 Ganita-yukti-bhasa, cited above, Section 6.4, pp. 61-67, 192-97, 382-88. 
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sankalitas SU? 
(k—1 k (k—1 (k-1 (k-1 
n$, ) ig Oe SR PS 2 4495 9 dores » (99) 
If the lower order sarikalita Gs has already been estimated to be, say, 


sh) a 100 
n p T ( ) 


then, the above relation (99) leads to”? 


7 = bs n—2)* n — 3)* 
nsf) = s# x oy oe EE e 


1 
x (-)s®. 101 
(a) " isi. 


Rewriting the above equation we have’* 


l E 
S® ~ ns 7D — (x) s. (102) 


SORNE (103) 


T3 As one of the reviewers has pointed out, this argument leading to (101) is indeed similar to the 
derivation of the following relation, which is based on the interchange of order in iterated integrals: 


1 1 1 1 y jo - 
J| (1 —-x*ldx = i se | dy dx x! >f x* ldx dy= [ Y ay. 
0 0 x o Jo o k 


74As Sankara Variyar states in his Kriyakramakarton Lilavati (cited above (fn. 14), p. 383): 


HA THAT MATA TAG es A ATT TAH 
q aaa afa feni 


Therefore it is established that, for obtaining the sum of the next order, the previ- 
ous sum, has to be multiplied by the radius and the present sum, divided by one 
more than the previous [order], has to be diminished [from that product]. 
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12.4. Repeated summations (Sa7ikalita-sankalita) 


After having estimated the sum of powers of natural numbers samaghüàta-sarkalita 
Yuktibhasa goes on to derive an estimate for the repeated summation (sarikalita- 
sankalita or sarikalitaikya or várasarikalita) of the natural number 1, 2, --- , n.? 


Now, are explained the first, second and further summations: The first summa- 
tion (adya-sarikalita) is the basic summation (rüla-sarikalita) itself. It has 
already been stated (that this is) half the product of the square of the number of 
terms (pada-vargardha). The second (dvittya-sankalita) is the summation of 
the basic summation (müla-sarikalitaikya). It has been stated earlier that it is 
equal to half the summation of squares. And that will be one-sixth of the cube of 
the number of terms. 

Now, the third summation: For this, take the second summation as the last term 
(antya); subtract one from the number of terms, and calculate the summation 
of summations as before. Treat this as the penultimate. Then subtract two from 
the number of terms and calculate the summation of summations. That will be the 
next lower term. In order to calculate the summation of summations of numbers in 
the descending order, the sums of one-sixths of the cubes of numbers in descend- 
ing order would have to be calculated. That will be the summation of one-sixth 
of the cubes. And that will be one-sixth of the summation of cubes. As has been 
enunciated earlier, the summation of cubes is one-fourth the square of the square. 
Hence, one-sixth of one-fourth the square of the square will be the summation 
of one-sixth of the cubes. Hence, one-twenty-fourth of the square of the square 
will be the summation of one-sixth of the cubes. Then, the fourth summation will 
be, according to the above principle, the summation of one-twenty-fourths of the 
square of squares. This will also be equal to one-twenty-fourth of one-fifth of 
the fifth power. Hence, when the number of terms has been multiplied by itself 
a certain number of times, (i.e., raised to a certain degree), and divided by the 
product of one, two, three etc. up to that index number, the result will be the 
summation up to that index number amongst the first, second etc. summations 
(adya-dvitiyadi-sarikalita). 


The first summation (@dya-sankalita) y,® is just the mula-sankalita or the 


basic summation of natural numbers, which has already been estimated in (89) 


vD =s = ni(n—1)-(n-2)4...—1 


n2 


— 4 
J (104) 


The second summation (dvitiya-sankalita or sankalita-sankalita or sankalitaikya) 
is given by 


1 (1 
yo = yO yy) TVO +... 
SOS SD rea e, elei sa (105) 


75 Ganita-yukti-bhasa, cited above, Section 6.4, pp. 61-67, 192-97, 382-88. 
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As was done earlier, this second summation can be estimated using the estimate 
(89) for SL 


nm (=i)? | (n—-2y 


VO zx (106) 


Therefore 


1 
VO x (5) s. (107) 


Using the earlier estimate (97) for 5) we getan estimate for the dvitiya-sarikalita 


n 
VO x r3 (108) 


Now the next repeated summation can be found in the same way 


v? = VIVO + yO +... 


n= 


3 m — 953 
n (n se Us an 


end (109) 


Q 


It is noted that proceeding this way we can estimate repeated summation v of 
order k, for large n, to be’® 


k k-1 (k-1) (k—1) 
VO = VEDEV Ey +... 
n 


32:322: (K+ 1) (110) 


7 


76These are again estimates for large n. As mentioned in Section 4, exact expressions for the first 
two summations, y and v. are given in Aryabhatiya, Ganitapada 21; and the exact expression 
for the k-th order repeated summation yo has been given (under the name vGra-sarikalita), by 
Narayana Pandita (c. 1350) in his Ganitakaumudi, 3.19. This exact expression for vi) is also 
noted in Section 7.5.3 of Yuktibhasa. 
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13. Derivation of the Madhava series for z 


The following accurate value of z (correct to 11 decimal places), given by Madhava, 
has been cited by Nilakantha in his Aryabhatzya-bhásya and by Sankara Variyar 
in his Kriyakramakari.’’ 


IEEPEERISHETTSIE EIER SE EESEIE RUE HE RE EN 
safradfae gafas ois gN: i 


The z value given above is: 


2827433388233 


Pelee 6 13:141992653592... 111 
z 9 x 10! um 


The 13 digit number appearing in the numerator has been specified using bhüta- 
sankhya system, whereas the denominator is specified by word numerals." 


13.1. Infinite series for z 


The infinite series for z attributed to Madhava is cited by Sankara Variyar in his 


commentaries Kriyakramakari and Yukti-dipika. Madhava's verse quoted runs 
as follows: ? 


xd VW FAM mu 
The diameter multiplied by four and divided by unity [is found and saved]. Again 
the products of the diameter and four are divided by the odd numbers like three, 


five, etc., and the results are subtracted and added in order [to the earlier result 
saved]. 


The series given by the verse may be represented as 


TEE 2571 
Paridhi = 4 x nen * I OUO )}: (112) 


71 Aryabhatiya-bhasya on Aryabhatiya, cited above (fn. 53), comm. on Ganitapada 10, p. 42; 
Kriyakramakari on Lilavatt, cited above (fn. 14), comm. on verse 199, p. 377. 

78i the bhüta-sankhya system, vibudha =33, netra =2, gaja =8, ahi =8, hutagana =3, 
triguna =3, veda =4, bha 227, varana =8, bahu =2. In word numerals, nikharva represents 10!!. 
Hence, nava-nikharva -9 x 10!!, 

79 op. cit., p. 379. 
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The words paridhi and vyasa® in the above equation refer to the circumference 
and diameter respectively. Hence the equation may be rewritten as 


* (i-i dem (113 
nc 3 i9 NE ) 


N I 

M» | Blown up version 

fn NA | | of this quadrant 
S | 


S 


FIGURE 8. Geometrical construction used in the proof of the 
infinite series for z. 


We shall now present the derivation of the above result as outlined in Yuk- 
tibhasa of Jyesthadeva and Kriyakramakari of Sankara Variyar. For this pur- 
pose, let us consider the quadrant O Po P} of the square circumscribing the given 
circle (see Figure 8). Divide the side Po P into n equal parts (n very large). PoP;'s 


80Njlakantha, in his Aryabhatiya-bhasya, presents the etymological derivation of the 
word vydsa as ‘the one which splits the circle into two halves: SOTA fa qd 
eT | (Aryabhatiya-bhasya, cited above (fn. 53), comm. on Ganitapada 11, p. 43). 
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are the bhujās and O P;’s are the karnas denoted by k;. The points of intersection 
of these karnas and the circle are marked as A;s. 


The bhujas Po P;, the karnas ki and the east-west line O Po form right-angled 
triangles whose hypotenuses are given by 


SNe 
ars (2) ; (114) 
n 
where r is the radius of the circle. 
The feet of perpendiculars from the points Aj_; and P;—; along the i^" karna 


are denoted by B; and C;. The triangles O P;—-1C; and O Aj-, B; are similar. 
Hence, 


Ai-1B; _ Pi-1Ci 


= : (115) 
OAi-| O Pi- 
Similarly triangles Pj; C; P; and Po O P; are similar. Hence, 
Pj-1Ci P, 
ECO (116) 
Pi-1P; OP; 


From these two relations we have, 


O Ai—1.0 Po. Pj, Pi 


Aj-1B} = 
OP;-,.OP; 
paip at, Oo 
= E OP;-\ O Pj 


(5) ir r 
= (-)x—x— 
n ki-1 ki 


Oz) am 


It is then noted that when n is large, the Rsines A;—; B; can be taken as the arc-bits 
themselves. 


vifus ears — wars 


i.e., Aj-1Bj > Aj-1Ai. 


Thus, ith of the circumference of the circle can be written as sum of the con- 
tributions given by (117). That is 


C r r? r? r? HA 
oea 
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Though this is the expression that actually needs to be evaluated, the text men- 
tions that there may not be much difference in approximating it by either of the 
following expressions: 


Ee = E) GGG) E) (119) 
ToO 9-9 v 


It can be easily seen that 
[s] E <[§| (121) | 
8 right 8 8 left | 


In other words, though the actual value of the circumference lies in between the 
values given by (120) and (119) what is being said is that there will not be much 
difference if we divide by the square of either of the karnas rather than by the 
product of two successive ones. Actually, the difference between (120) and (119) 


is given by 
I 


G-E- O-O] emet | 


or 


(122) 


| 
ams 
3x 
— 
REN. 
NI = 
— 
ipM 


Evidently this difference approaches zero as n becomes very large, as noted in 


both the texts Yuktibhasa and Kriyakramakarz. v 
The terms in (120) are evaluated using the sodhya-phala technique (binomial 
series, discussed earlier in Section 11) and each one of them may be re-written in 


the form?! 
rfr? r r({kK-r r [k -r? 2 
{| = ERA A > S pcs z —... (123) 
n \ k? n n r n r 


NC 
$IIt may be noted that this series is convergent since k? = r? + (&) and 0 < (2-7) «P 


fori « n. 
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Using (114) and (123) in (120), we obtain: 


oo| Q 
Il 
zjx 
ZEE 
AV 7s 
— 


- EO zy T 
SRL. ms 
Ee 
Seg 3 Resa ER 
2669) KO +(G) +--+ Ey" 
KOC O=- E 


Each of the terms in (126) is a sum of results (phala-yoga) which we need to 
estimate when n is very large, and we have a series of them (phala-parampara) 
which are alternatively positive and negative. Clearly the first term is just the sum 
of the bhuja-khandas. 

The bhujas themselves are given by the integral multiples of bhuja-khanda, 
namely, 7, z, ... ^r. In the series expression for the circumference given above, 


we thus have the sarikalitas or summations of even powers of the bhujas, such 


2 
as the bhuja-varga-sankalita, (ay + (a) 4o (ge. bhuja-varga-varga- 


n 


Se 4 4 4 
sankalita, (E) + (%) +..... + (Z=), and so on. 

If we take out the powers of bhuja-khanda 7, the summations involved are 
those of even powers of the natural numbers, namely edadyekottara-varga- 
sankalita, 12 + 2? + ... +n*, edadyekottara-varga-varga-sankalita, 14 + 2* + 
... + n*, and so on. 
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Now, recalling the estimates that were obtained earlier for these sarikalitas, 
when 7 is large, 


pe tn 
i^ 75 ; (127) 
2 k+l 
. 82 
we arrive at the result 


C ] I 
=r d-——B--— M 
3 r( 3*5 7t ) (128) 


which is same as (112). 


14. Derivation of end-correction terms (Antya-samskāra) 


It is well known that the series given by (112) for Ẹ is an extremely slowly con- 
verging series. It is so slow that even for obtaining the value of z correct to 2 
decimal places one has to find the sum of hundreds of terms and for getting it 
correct to 4-5 decimal places we need to consider millions of terms. Madhava 
seems to have found an ingenious way to circumvent this problem. The technique 
employed by Madhava is known as antya-samskdra. The nomenclature stems 
from the fact that a correction (sarnskara) is applied towards the end (anta) of the 
series, when it is terminated after considering only a certain number of terms from 
the beginning. 


14.1. The criterion for antya-samskara to yield accurate result 
The discussion on antya-samskarain both Yuktibhasa and Kriyakramakari com- 
mences with the question: 


How is it that one obtains the value of the circumference more accurately by doing 
antya-samskara, instead of repeatedly dividing by odd numbers? 83 


82In modern terminology, the above derivation amounts to the evaluation of the following integral 


G 3 L r? = 1 dx 
C 2E) nE ed 

Say wawa pgíduuugpnevs cep ud: wea AAA- 
UPR AÀ | SAA |... (Kriyakramakari on Lilávati, cited above (fn. 14). 


comm. on verse 199, p. 386.) 
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The argument adduced in favor of terminating the series at any desired term, still 
ensuring the accuracy, is as follows. Let the series for | be written as 


T l 1 l 3 ] -n dl 
EE = re 4 (1) ihe l 129 
; BEN CDI rco? — (129) 


where n is the correction term applied after odd denominator p — 2. On the 
p-2 


other hand, if the correction term ^ 


a-, is applied after the odd denominator p, then 
P 


tod 
Sj, we 


-1- 


-3 -1] «1 l 
x MS LE ps +p. 39 
4 p-2 p 


ap 
If the correction terms indeed lead to the exact result, then both the series (129) 
and (130) should yield the same result. That is, 


l 1 1 1 l I 
I or a oe (131) 
(15-2. DAG Ge ay. D 


is the criterion that must be satisfied for the end-correction (antya-samskara) to 
lead to the exact result. 


14.2. Successive approximations to get more accurate correction-terms 


The criterion given by (131) is trivially satisfied when we choose ap—2 = ap = 
2p. However, this value 2p cannot be assigned to both the correction-divisors®4 
ap—2 and ap because both the corrections should follow the same rule. That is, 


ap-2=2p, > ap = 2(p +2) 
or, dp =2p, => ap-2=2(p—2). 


We can, however, have both ap—2 and ap close to 2p by taking ap-2 = 2p — 2 
and ap = 2p + 2, as there will always persist this much difference between p — 2 
and p when they are doubled. Hence, the first (order) estimate of the correction 
divisor is given as, "double the even number above the last odd-number divisor 


p”, 
ap = 2(p +1). (132) 
But, it can be seen right away that, with this value of the correction divisor, the 


condition for accuracy (131), stated above, is not exactly satisfied. Therefore a 


84By the term correction-divisor (samskära-hāraka) is meant the divisor of the correction term. 
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measure of inaccuracy (sthaulya) E(p) is introduced 


l l I 
Ep =| *z]- (133) 


üp-2 ap 


Now, since the error cannot be eliminated, the objective is to find the correction 
denominators ap such that the inaccuracy E(p) is minimised. When we set ap = 
2(p + 1), the inaccuracy will be 


1 I I 
E — sf d 
tp) las sl p 


l 
= ——. (134) 
(p? — p) 
This estimate of the inaccuracy, Ep being positive, shows that the correction has 
been over done and hence there has to be a reduction in the correction. This means 
that the correction-divisor has to be increased. If we take ap = 2p + 3, thereby 
leading to 45—2 = 2p — 1, we have 


1 1 ] 

MIELE f D* es] p 
2 APE MN (135) 

(4p* + Ap* — 3p) 
Now, the inaccuracy happens to be negative. But, more importantly, it has a term 
proportional to p in the numerator. Hence, forlarge p, E(p) given by (135) varies 
inversely as p?, while for the divisor given by (132), E(p) as given by (134) varied 
inversely as pi 

From (134) and (135) it is obvious that, if we want to reduce the inaccuracy and 
thereby obtain a better correction, then a number less than 1 has to be added to the 
correction-divisor (132) given above. If we try adding rüpa (unity) divided by the 
correction divisor itself, i.e., if we set ap = 2p+2+ CPI the contributions from 


the correction-divisors get multiplied essentially by (x). Hence, to get rid of the 


higher order contributions, we need an extra factor of 4, which will be achieved if 
we take the correction divisor to be 


_ 2p+2)?+4 


—— 136 
(2p +2) (2p +2) um 


ap = 2p +2) + 


85r may be noted that among all possible correction divisors of the type ap = 2p + m, where m 
is an integer, the choice of m — 2 is optimal, as in all other cases there will arise a term proportional to 
p in the numerator of the inaccuracy E(p). 
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Then, correspondingly, we have 


2p—2)?+4 
HUE ee eee (137) 
(2p — 2) (2p — 2) 
We can then calculate the inaccuracy to be 
l l l 
ae gi a f (;) 
2p —2 2 2 
Ure CPt 2315 
> (4p?) (16p* + 64) 
~ | (4p4 + 16) 4p(4p4 + 16) 
—4 
—— LA (138) 
(p? -- 4p) 


Clearly, the sthaulya with this (second order) correction divisor has improved 
considerably, in that it is now proportional to the inverse fifth power of the odd 
number.96 

At this stage, we may display the result obtained for the circumference with the 
correction term as follows. If only the first order correction (132) is employed, we 
have 


1 (p-) 1 (pil) 1 
C=4d)1-—=4+...4+(-l) ? — —]) 2 al 139 
l 5 1b" RETE CUN era (139) 


If the second order correction (136) is taken into account, we have 


Gea E Oene - 


2p+2)+ 


(2p + 2) 
! (p +1) 
Z nas fd mM (i) SP 2 O 140 
4d|1 pale 1) 37 1) GUY (140) 


86j may be noted that if we take any other correction-divisor ap = 2p + 2 + TPID’ where m is 


an integer, we will end up having a contribution proportional to p? in the numerator of the inaccuracy 
E(p), unless m = 4. Thus the above form (136) is the optimal second order choice for the correction- 
divisor. 
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The verse due to Madhava that we cited earlier as defining the infinite series for 
uz 


4 is, in fact, the first of a group of four verses that present the series along with 
the above end-correction.*? 


ara anfafed *wgd emm i 


FaurgfafauzrurgIRR3IUD vd Ure PA meuf 
aaga sa gud qd ager fau sofa 
TI RATATAT TART TES) TST ATT d 
Te IGA En arfaa: WD 
maaa TAT FA YÀ anm va HHT: I 

aq: fe A ii agape ecards fepe HATA i 


The diameter multiplied by four and divided by unity. Again the products of 
the diameter and four are divided by the odd numbers like three, five, etc., and the 
results are subtracted and added in order. 


Take half of the succeeding even number as the multipler at whichever [odd] 
number the division process is stopped, because of boredom. The square of that 
[even number] added to unity is the divisor. Their ratio has to be multiplied by 
the product of the diameter and four as earlier. 


The result obtained has to be added if the earlier term [in the series] has been 
subtracted and subtracted if the earlier term has been added. The resulting cir- 
cumference is very accurate; in fact more accurate than the one which may be | 
obtained by continuing the division process [with large number of terms in the | 
series]. 


Continuing this process further, Yuktibhasa presents the next order correction- 
term which is said to be even more accurate:5? 


yt agaa: Fat Tw: WD Vd gA: | 
anf syad: wengmeogd Nd gs | 


At the end, [i.e., after terminating the series at some point, apply the correction | 
term with] the multiplier being square of half of the [next] even number plus 1, | 
and the divisor being four times the same multiplier with 1 added and multiplied | 
by half the even number. | 


87K riyakramakari on Lilàvati, cited above (fn. 14), comm. on verse 199, p. 379. 
88 Ganita-yukti-bhasa, cited above, p. 82; Also cited in Yukti-dipika on Tantrasarigraha, 
cited above (fn. 49), comm. on verse 2.1, p. 103. 
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In other words,*? 


5 XN 
| e)" 


n+ | 
ve (p 1 ea n (73 ) 
I 
UR RIP (141) 
(2p+2)+ 16 
2 2+ ——— 
p+ Papas 


Hence, a much better approximation for + is:°° 


Pista iNé: 
1 1E oe d l GE) + 


(p? e n (222) 


(142) 


89The inaccuracy or sthaulya associated with this correction can be calculated to be 


2304 
(64 p! + 448p5 + 1792 p3 — 2304p) ` 


E(p) — 


The inaccuracy now is proportional to the inverse seventh power of the odd-number. Again it can be 
shown that the number 16 in (141) is optimally chosen, in that any other choice would introduce a term 
proportional to p? in the numerator of E(p), given above. 

In fact, it has been noted by C. T. Rajagopal and M. S. Rangachari that D. T. Whiteside has shown 
(personal communication of D. T. Whiteside cited in C. T. Rajagopal and M. S. Rangachari, “On an 
untapped source of medieval Kerala mathematics’, Arch. for Hist. Sc. 35(2), 89-102, 1978), that the 
end correction-term can be exactly represented by the following continued fraction 


1 1 


a 22 
(pts) z7 
(2p +2) + 7 
Dpi Dja B 
Cra?) ED 


901 may be noted that this correction term leads to a value of x , which is accurate up to 11 decimal 
places, when we merely evaluate terms up to n — 50 in the series (142). Incidentally the value of z, 
given in the rule vibudhanetra..., attributed to Madhava that was cited in the beginning of Section 13, 
is also accurate up to 11 decimal places. 
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15. Transforming the Madhava series for better convergence 


After the estimation of end-correction terms, Yuktibhasa goes on to outline a 
method of transforming the Madhava series (by making use of the above end- 
correction terms) to obtain new series that have much better convergence prop- 
erties. We now reproduce the following from the English translation of Yuk- 
tibhasa:?! 


Therefore, the circumference (of a circle) can be derived in taking into consider- 
ation what has been stated above. A method for that is stated in the verse 


UT Sela Tada aa: ae: | 
IER AGES, enra: URRE (o 


The fifth powers of the odd numbers (1, 3, 5 etc.) are increased by 
4 times themselves. The diameter is multiplied by 16 and it is suc- 
cessively divided by the (series of) numbers obtained (as above). The 
odd (first, third etc.) quotients obtained are added and are subtracted 
from the sum of the even (the second, fourth etc.) quotients. The 
result is the circumference corresponding to the given diameter. 

Herein above is stated a method for deriving the circumference. If the cor- 
rection term is applied to an approximate circumference and the amount of inac- 
curacy (sthaulya) is found, and if it is additive, then the result is higher. Then 
it will become more accurate when the correction term obtained from the next 
higher odd number is subtracted. Since it happens that (an approximate circum- 
ference) becomes more and more accurate by making corrections in succeeding 
terms, if the corrections are applied right from the beginning itself, then the cir- 
cumference will come out accurate. This is the rationale for this (above-stated 
result). 

When it is presumed that the correction-divisor is just double the odd number, 
the following is a method to obtain the (accurate) circumference by a correction 
for the corresponding inaccuracy (sthaulyàmsa-parihara), which is given by 
the verse: 


arare aka vum zumpufeneus: | 
Aua Saye FHT: Peat RATAA: dan 


The diameter is multiplied by 4 and is divided, successively, by the 
cubes of the odd numbers beginning from 3, which are diminished by 
these numbers themselves. The diameter is now multiplied by three, 
and the quotients obtained above, are added to or subtracted from, 
alternatively. The circumference is to be obtained thus. 
If, however, it is taken that half the result (of dividing) by the last even number 
is taken as the correction, there is a method to derive the circumference by that 


way also, as given p^ the verse 
RECON ante fetter sy | 
Way uarie scitiodiorg fare area EATR nay 


The squares of even numbers commencing from 2, diminished by 


91 Ganita-yukti-bhasa, cited above, Section 6.9, pp. 80-82, 205-07, 402-04. 
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one, are the divisors for four times the diameter. (Make the several 
divisions). The quotients got by (the division) are alternately added 
to or subtracted from twice the diameter. In the end, divide four times 
the diameter by twice the result of squaring the odd number following 
the last even number to which is added 2. 


The method of sthaulya-parihara, outlined above, essentially involves incor- 
porating the correction terms into the series from the beginning itself. Let us recall 
that inaccuracy or sthaulya at each stage is given by 


l 1 l 
E(g)- — 4 — - (2). (143) 
üp-2 Gp p 
The series for the circumference (112) can be expressed in terms of these sthaulyas 
as follows: 
C= 


l 1 
afha GiG 
aj ai AS a3 as 5 


ad] (1-2) + £0) £6) +£0)-...]. (144) 
1 


Now, by choosing different correction-divisors ap in (144), we get several 
transformed series which have better convergence properties. If we consider the 
correction-divisor (136), then using the expression (138) for the sthaulyas, we get 


C 


1 l 1 | 
dd ( = 3 sud od lo: +43) (5544.5) al Gee — E 


1 1 1 
= a ay - a tae - (145) 


The above series is given in the verse samapancahatayoh . . (I). Note that each 
term in the above series involves the fifth power of the odd number in the denom- 
inator, unlike the original series which only involved the first power of the odd 
number. Clearly, this transformed series gives more accurate results with fewer 
terms. 
If we had used only the lowest order correction (132) and the associated sthaulya 

(134), instead of the correction employed above, then the transformed series is the 
one given in the verse vyasad varidhinihatat. . .(1I) 


3 1 1 1 
= -= + = — ____ H ____-_., |, 146 
ieee ] € 


Note that the denominators in the above transformed series are proportional to the 
third power of the odd number. 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Development of Calculus in India 269 


Even if we take non-optimal correction-divisors, we often end-up obtaining 
interesting series. For instance, if we take a non-optimal correction-divisor, say of 
the form a, = 2p, then the sthaulya is given by 


1 I 1 
E(p). — | ee 
(p) (2p—4) 2p p 
~ (p?-2p) 
= v (147 
(p— 1) ) 


Then, the transformed series will be the one given in the verse duyddiyujam 
92 


và krtayo. . (III) 


I l l l 
= 4 = H | = —— te he 
€ JE ~ @=1) 42290 3 (G= i) E. | ei 


16. Derivation of the Madhava series for Rsine and Rversine 
16.1. First and second order differences of Rsines 


We shall now outline the derivation of Madhava series for Rsine (bhuja-jya) and 
Rversine (Sara), as given in Yuktibhasa.?° Yuktibhasa begins with a discussion 
of the first and second order Rsine-differences and derives an exact form of the 
result of Aryabhata that the second-order Rsine-differences are proportional to 
the Rsines themselves. We had briefly indicated this proof in Section 5.3. 

Here we are interested in obtaining the Madhava series for the jya and Sara of 
an arc of length s indicated by EC in Figure 9. This arc is divided into n equal 
arc bits, where n is large. If the arc length s = R9, then the j-th pinda-jya, B; is 


given by? 
llo : jo 
o= —|-—R Jn 4! 
Bj a (5) sin (2) (149) 


92The verse III in fact presents the series (148) along with an end correction-term of the form 


(Capo 
2(p1)*42 
3 Yuktibhasà, cited earlier, Vol. I Section 16.5, pp. 94-103, 221—233, 417-427. 
94Figure 9 is essentially the same as Figure 3 considered in section 5 except that the pindajyas Bj 
are Rsines associated with multiples of the arc-bit = into which the arc EC = s is divided. In Figure 3, 
the B's are the tabular Rsines associated with multiplies of 225". 
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The corresponding koti-jya K j, and the sara Sj, are given by 


koti (4) = Reos (72), (150) 
n n 
j jo 
& (75) = n1 -cos (2)]. (151) 
n n 


Now, C;Cj+1 represents the (j + 1)-th arc bit. Then, for the arc EC; = E, 


its pinda-jyà is B; = Cj Pj, and the corresponding koti-jyā and sara are Kj = 
C;Tj;, S; = EP}. Similarly we have 


Kj 


Sj 


Bj+1 = CjgiPjgi, Kj+1 = Cj+iTj41 and Sj41 = EPj+1. (152) 


Ta Uji Tj y 9 
FIGURE 9. Computation of Jyà and Sara by Sarikalitas. 


Let M;4.1 be the mid-point of the arc-bit C; C;..; and similarly M; the mid- 
point of the previous (j-th) arc-bit. We shall denote the pinda-jya of the arc 
EM j4 as Bii and clearly 


Bi = Mj+iQj+i - 
The corresponding koti-jya and Sara are 


Kj,1 = Mju Uj+i and Sj43 = EQj+i - 
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Similarly, 


B; .j = MjQj, K; = M;U; and Sj} = EQj. (153) 


1 
Ty! 
Let a be the chord corresponding to the equal arc-bits = as indicated in Fig- 
ure 9. That is, CjCj+1 = MjMj+1 = a. Let F be the intersection of CjTj 
and Cj+1Pj+1, and G of MjU; and Mj+1Qj+1. The triangles Cj; FC; and 
O Q j41M j4.1 are similar, as their sides are mutually perpendicular. Thus we have 


Cj+1Cj Cj F FC; 
OMj+1 = OQj+1 Oj4+iMj+i 
Hence we obtain 
a 
Bji1—Bj = (2) Dvd (155) 
a 
Kj—Kjtt = Sjui -Sj= (=) Bias. (156) 
Similarly, the triangles M;;;GMj; and OP; C; are similar and we get 
Mj41M; MjiiG GM; 
= ee (157) 
OC; OP; PjCj 
Thus we obtain 
a 
m pes ($) Kj, (158) 
a 
E E E = (S) Br- (159) 
We define the Rsine-differences (khanda-jya) A j by 
Aj = Bj — Bj-1, (160) 
with the convention that A; = Bj. From (155), we have 
a 
-=(—)K.1. 161 
e s) Kj-i dep 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


272 K. Ramasubramanian and M. D. Srinivas 


From (159) and (161), we also get the second order Rsine-differences (the differ- 
ences of the Rsine-differences called khanda-jyantara): 


Ay A = (Bj — Bj-1) — (Byj+1 — Bj) 
= x) (5-4 = Kins) 
(8) (Sit 7 5-4) 


(162) 


I 
— 
>| 
SS 

N 
SS 


Now, if the sum of the second-order Rsine-differences, is subtracted from the first 
Rsine-difference, then we get any desired Rsine-difference. That is 


Ai —[(A; — Az) + (A2 — A3) +... + (Aj-1 — ^)] = Aj. (163) 


From (162) and (163) we conclude that 
a\2 
Ai- (5) Gi Br +--+ Bj-1) = Aj. (164) 


16.2. Rsines and Rversines from Jyda-sankalita 


We can sum up the Rversine-differences (159), to obtain the sara, Rversine, at the 
midpoint of the last arc-bit as follows: 


S) — 51 E ($175, 3) eid». (5; -5,) 
= (3) a-i + Bra +... + Bi) (165) 


Using (162), the right hand side of (165) can also be expressed as a summation of 
the second order differences. From (164) and (165) it follows that the Rversine at 
the midpoint of the last arc-bit is also given by 


a 

= — — — à 166 
(5) (5-4-3) e - 29 I 
Now, since the first Rsine-difference A; = B,, any desired Rsine can be obtained 
by adding the Rsine-differences; these Rsine-differences have been obtained in 
(164). Now, by making use of (164), the last pinda-jyà can be expressed as fol- 


lows: 


Bn = And An-1 t... å; 
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a\2 
-aAi- (=) [(B1 + Bo... + Bn-1) + (Bi + B2... + Bn-2) +... + Bi] 


=nB, — (=) [Bn-1 + 2Bn-2 +--+ (n — 8i]. (167) 


The results (158) — (167), obtained so far, involve no approximations. It is now 
shown how better and better approximations to the Rsine and Rversine can be 
obtained by taking n to be very large or, equivalently, the arc-bit = to be very 
small. Then, we can approximate the full-chord and the Rsine of the arc-bit by the 
arc-bit 7 itself. Also, as a first approximation, we can approximate the pinda-jyas 
Bj in the equations (164), (165) or (167) by the corresponding arcs themselves. 
That is 


BAN (168) 


The result for the Rsine obtained this way is again used to obtain a better approx- 
imation for the pinda-jyas B; which is again substituted back into the equations 
(165) and (167) and thus by a process of iteration successive better approximations 
are obtained for the Rsine and Rversine. Now, once we take B; ~ E, we will be 
led to estimate the sums and repeated sums of natural numbers (ekadyekottara- 
sankalita), when the number of terms is very large. 


16.3. Derivation of Madhava series by iterative corrections to jya and sara 


As we noted earlier, the relations given by (165) and (167) are exact. But now we 
shall show how better and better approximations to the Rsine and Rversine of any 
desired arc can be obtained by taking n to be very large or, equivalently, taking the 
arc-bit 2 to be very small. Then both the full-chord a, and the first Rsine B, (the 
Rsine of the arc-bit), can be approximated by the arc-bit = itself, and the Rversine 
Sie j can be taken as 5, and the Rversine S j may be treated as negligible. Thus 


n 
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the above relations (165), (167) become?? 


S=S, z (5) Ga- + Ba-2 +... + Bi), (169) 
Me S \2 
B = Bn ~ se) [(B1 + B2 +... + Bn-1) 
+ (Bi + B2... + Bn-2)+...+ Bi], (70) 


where B and S are the Rsine and Rversine of the desired arc of length s and the 
results will be more accurate, larger the value of n. 

Now, as a first approximation, we take each pinda-jya B j in (169) and (170) to 
be equal to the corresponding arc itself, that is 


eee 


Bj ~ T (171) 
Then we obtain for the Rversine 
s = (4)[m-n()+0-(2)+..] 
1 $42 
= (x) (7) [m — 1) - (n —2) 9 .. ]. (172) 


For large n, we can use the estimate (89) for the sum of integers. Hence (172) 


reduces to 
1\ s? 
SEA 173 
(x)5 (173) 


Equation (173) is the first sara-samskara, correction to the Rversine. We now 
substitute our first approximation (171) to the pinda-jyas B j in (170), which gives 
the Rsine of the desired are as a second order repeated sum of the pinda-jyas Bj. 
We then obtain 


1M. 4543 
Bes-(z) ( ) TUE YS (19-2 «(01 —2)) +... (174) 


n 


95 As has been pointed out by one of the reviewers, in the following derivation instead of using 
the relation (170), which involves repeated summation of pindajyàs, one could use the much simpler 
relation 3 

B = Br xs- ag +Sn—2 +... + S1), 
which essentially follows from (165) and (170). Then we can iterate between the above equation 
and (169) which involve considering only sums of powers of integers. Yuktibhāsā, however, em- 
ployes successive iteration between (169) and (170), which involves consideration of repeated sums of 
integers. 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Development of Calculus in India 275 


The second term in (174) is a dvitzya-sarikalita, the second order repeated sum, 
and using the estimate (108), we obtain 


2 3 
1 s 
R 123 


Thus we see that the first correction obtained in (175) to the Rsine-arc-difference 
(jya- capantara-sarnskara), is equal to the earlier correction to the Rversine (Sara- 
samskara) given in (173) multiplied by the arc and divided by the radius and 3. 

It is noted that the results (173) and (175) are only approximate (prayika), 
since, instead of the sarikalita of the pinda-jyas in (169) and (170), we have only 
carried out sarikalita of the arc-bits. Now that (175) gives a correction to the 
difference between the Rsine and the arc (jya@-capantara-samskara), we can use 
that to correct the values of the pinda-jyàs and thus obtain the next corrections to 
the Rversine and Rsine. 

Following (175), the pinda-jjas may now be taken as 


UN 
aye P (y (a) : (176) 


1 S\2 
S zx (x)G) [(n — 1) 4-(n —2) 9 ...] 


- (=) (3) (2) (3)ie a (177) 


The first term in (177) was already evaluated while deriving (173). The second 
term in (177) can either be estimated as a summation of cubes (ghana-sankalita), 
or as a írtiya-sarikalita, third order (repeated) summation, because each individ- 
ual term there has been obtained by doing a second-order (repeated) summation. 
Hence, recollecting our earlier estimate (110) for these sarikalitas, we get 


1\ s? ibo 
£4 = aes (eo eee 178 
S (x) 1.2 (x) 1.2.3.4 s. 
Equation (178) gives a correction (sara-sarnskara) to the earlier value (173) of 
the Rversine; which is nothing but the earlier correction to the Rsine-arc difference 


(jyà-capantara-samskara) given in (175) multiplied by the arc and divided by the 
radius and 4. 
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Again, if we use the corrected pinda-jyàs (176) in the expression (170) for the 
Rsine, we obtain 


1\2 4543 
B x :-(x) (=) (aera 1)) + (14-2:4-.. -- (n —2)) --] 
4 


y 


(15850 9T 3y EO + 22: +... + (n -29)4.] 


3 
1 s N 
x sjes =) o———— 179 
S (x) i33 * (x) 1.2.3.4.5 u^ 


m7 l^ 


The above process can be repeated to obtain successive higher order correc- 
tions for the Rversine and Rsine: By first finding a correction (jya-capantara- 
samskàra) for the difference between the Rsine and the arc, using this correction 
to correct the pinda-jyas B j, and using them in equations (169) and (170) get the 
next correction (Sara-samskàra) for the Rversines, and the next correction (7yà- 
capantara-samsküra) for the Rsine-arc-difference itself, which is then employed 
to get further corrections iteratively. In this way we are led to the Madhava series 
for jyà and sara given by 


1 2 3 1 4 s5 
sin(s) s (x) ass * (x) (1.2.3.4.5) 
1 6 s? 
— (x) (1.2.3.4.5.7) + n 
1 R2 1 3 sf 
SR = ix)J2- ix) G23 
vers(s) (x) 2 (x) (1.2.3.4) 
1 5 s 
Eae ee 80 
* (x) (1.2.3.4.6) dd 
That is, 
8? 0? 9? 
Semen (ay) 2345) 0224562 1 
2 4 6 
vers 0 g 2 : Sm 


(@2) (0123:  (12.45.6) 
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17. Instantaneous velocity and derivatives 


As we saw in Section 6.1, the mandaphala or the equation of centre for a planet 
A is given by 


Rsin(Ag) = (Z) Rsin(M — a), (182) 


where ro is the mean epicycle radius, M is the mean longitude of the planet and a 
the longitude of the apogee. Further as we noted earlier, Munjala, Aryabhata II 
and Bhaskara II used the approximation 


Rsin(Ag) 7 An, (183) 


in (182) and obtained the following expression as correction to the instantaneous 
velocity of the planet: 


d ro d 
40 = (=) Rcos(M — a)7- (M — a). (184) 


Actually the instantaneous velocity of the planet has to be evaluated from the more 
accurate relation 


Ap = Rsin"! (3) Rsin(M — a)]. (185) 


The correct expression for the instantaneous velocity which involves the derivative 
of arc-sine function has been given by Nilakantha in his Tantrasarigraha.”° 


Let the product of the kotiphala [ro cos(M — a)] in minutes and the daily motion 
of the manda-kendra (202) be divided by the square root of the square of 


the b@huphala subtracted from the square of trijya (V R2- Ta sin?(M — 2). 


The result thus obtained has to be subtracted from the daily motion of the Moon 
if the manda-kendra lies within six signs beginning from Mrga and added if it 


96 Tantrasarigraha, cited above (fn. 52), verses 2.53-54, pp.169-170. Elsewhere, Nilakantha 
has ascribed these verses to his teacher Damodara (Jyotirmimamsa, Ed. by K. V. Sarma, VVRI, 
Hoshiarpur 1977, p. 40). 
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lies within six signs beginning from Karkataka. The result gives a more accu- 
rate value of the Moon's angular velocity. In fact, the procedure for finding the 
instantaneous velocity of the Sun is also the same. 


If (M — a) be the manda-kendra, then the content of the above verse can be 
expressed as 


s(M ) d(M — a) 

1 3 ro co OS) E mr 

7 [sin-! (2 sin(M — «)] = dt : (186) 
à R JR? — rd sin?(M — a) 


The instantaneous velocity of the planet is given by 


d(M — a) 
dt 
JR — ré sin?(M — a) 


ro cos(M — a) 


e M a) 
enpe 5m 
ail dt S 


(187) 


Here, the first term in the RHS represents the mean velocity of the planet and the 
second term the rate of change in the mandaphala given by (186). 

In his Aryabhatiya-bhasya, Nilakantha explains how his result is more correct 
than the traditional result of Muñjāla and Bhaskaracarya:?” 


meret RTARTA Ta erc TUT ATT 
TA: CATT! em 2 


Toast uq EAS, SAAT 
aaa RAA ae are ga quim aif 
Arg ur amos: anita at aA ec 
Arape Ea TAMA aT qa AUTH... 


Hence, how can the results be equal? ... Again the distinction being: there it was 
prescribed that the multiplier koti-jyà was to be divided by trijya, [but] here 
it has been prescribed that the product of kotzphala and the rate of change of 
kendra be divided by koti of the dohphala (dohphalakotya).28 


97 Aryabhatiya of Aryabhata, Ed. with Bhásya of Nilakantha Somayajt by K. Sambasiva 
Sastri, Trivandrum Sanskrit Series 110, Trivandrum 1931, comm. on Kalakriyapada 22-25, pp. 62- 
63. 


98The terms dohphala and kotiphala refer to 3 sin(M — a) and F cos(M — a) respectively. 


Hence, the term dohphalakoti refers to ,/1 — (7B sin(M — a))?. 
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17.1. Acyuta’s expression for instantaneous velocity involving the derivative 
of ratio of two functions 


In the third chapter of his Sphutanirnayatantra, Acyuta Pisarati (c. 1550— 
1621), a disciple of Jyesthadeva, discusses various results for the instantaneous 
velocity of a planet depending on the form of equation of centre (manda-samskara). 
He first presents the formula involving the derivative of arc-sine function given by 
Nilakantha (in the name of (manda)-sphutagati) as follows:” 


piera das arqomeq amas | 
Amap aeranta Sete: i 


Acyuta also gives the formula for the instantaneous velocity of a planet if one 
were to follow a different model proposed by Munjala for the equation of centre, 
according to which mandaphala is given by 


ro . 
— sin(M — a) 
Au =R (188) 


(1 -— A cos(M -«)) j 


instead of (182), where Ay is small. If one were to use this formula for man- 
daphala for finding the true longitude of the planet, then it may be noted that the 
instantaneous velocity will involve the derivative of the ratio of two functions both 
varying with time. Taking note of this, Acyuta observes; 0 


The procedure that was prescribed earlier is with reference to the School that 
conceives of the increase and decrease in the circumference of the manda-vrtta 
in accordance with the karna. With reference to the School that conceives of 
increase and decrease only according to the half [of it], now we prescribe the 
appropriate procedure to be adopted. 


Acyuta then proceeds to give the correct expression for the instantaneous velocity 
of a planet in Munjala's model:!?! 


99 Sphutanirnayatantra of Acyuta Pisarati, Ed. by K. V. Sarma, VVRI, Hoshiarpur 1974, 
p. 19. 

IO0ppiq.. p. 20. 

10ltbid., p. 21. 
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Having applied the kotiphala to tr ~ijy@ [positively or negatively depending upon 
the mandakendra], let the square of the dohphala be divided by that. This may 
be added to or subtracted from the kotiphala depending on whether it is Mrgadi 
or Karkyadi. The product of this [result thus obtained] and the daily motion of 
the manda-kendra divided by the kotiphala and applied to trijyà will be the 
correction to the daily motion. 


Thus according to Acyuta, the correction to the mean velocity of a planet in 
order to obtain its instantaneous velocity is given by 


(2 SUM t sin(M — a) 


AL s 
I—R cos(M — a)) d(M — a) 


> (189) 
PEA z dt 
(1 R cos(M a)) 


which is nothing but the derivative of the expression given in (188). 
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Notes on Yuktibhasa: Recursive 
Methods in Indian Mathematics 


P. P. Divakaran 


1. Overview 
1.1. Yuktibhasa 


It has taken a long time for historians of mathematics to move from curiosity to 
uncertain admiration to well-informed scholarly recognition of the brilliance of the 
mathematicians/astronomers who lived and worked in Kerala (on the southwest 
coast of India) from the last quarter of the 14th century CE until the end of the 
16th. As the undertaking of bringing to critical attention the totality of their work 
gathers pace, one thing has already become clear: it can no longer be doubted 
that the high point of their mathematical achievements is the invention of calculus 
and its systematic and sophisticated development for application to trigonometric 
functions. Three personalities have emerged as key figures in this story. The first 
of course is the still shadowy Madhavan (Madhavan Emprantiri, or Madhava in 
Sanskritised form; throughout this article, I employ the Malayalam way of writing 
names from Kerala, with a terminal n), the founder of the school, who is credited 
by his followers with having created much of their strikingly original mathematics 
and whose creative genius pervades everything that they subsequently did. Then 
we have the polymath Nilakanthan (Nilakantha Somayaji), the pivotal link between 
Madhavan and the later generations and the author of a large body of surviving 
work, notably Tantrasamgraha (TS from now on). And the third is Jyesthadevan 
who wrote what can accurately be called the first textbook of calculus, Yuktibhasa 
(YB from now on). YB is a comprehensive account, in its last two chapters, of the 
fundamental principles of integral and differential calculus (in that natural order in 
the Kerala approach to calculus), as well as the relationship between them, and their 
use in the study of trigonometric functions. 

Until recently, TS has served as the urtext for students of the final flaring of 
the long tradition of Indian mathematics and astronomy that the Kerala school 
represents. It is a compendium of the new mathematics of Madhavan and of its use 
in astronomy, in a compact Sanskrit verse format, and itis precisely dated (1500 CE) 
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at the midpoint of the relevant period. Within a relatively short period of about half 
a century after its writing, a detailed commentary appeared in two separate works 
by Saikaran (Sankara Variyar), Yuktidipika and Laghuvivrti, providing details of 
several of the constructional and computational rules of 7S.' YB, though it advertises 
itselfin the opening sentence as following TS, is quite a different type of work. Rather 
than being a commentary (vyakhyd or bhasya), it is a stand-alone exposition of the 
material covered in 7S, segregating (unlike 7S itsef) the mathematics in a first part 
and its use in the construction of an accurate planetary model in the second. The 
mathematical Part I is all that will concern us here and what the abbreviation YB will 
refer to. It is distinctive in other ways too: at a time when all scientific texts were 
composed in Sanskrit, often in terse verse, it was written in Malayalam prose which 
is anything but terse, it is self-contained, and it provides reasoned justifications 
(yukti, which I will use interchangeably with ‘proof’ from now on)? for all results 
cited. In other words, YB gives us a portrait of the mathematical culture of its 
time as few other Indian texts of any period do. And since it also cites all earlier 
developments relevant to its concerns, it is an ideal vantage point from which to 
survey the entire landscape of Indian mathematical thought of which the work of 
the Kerala school forms the final summit. An English translation of YB by the late 
K. V. Sarma has recently been published [4], enabling scholars without Malayalam 
to finally get to know it almost at first hand?. The present article has taken advantage 
of its publication by assuming that the reader will have it to hand; I shall cite it 
frequently as [Sarma], along with the relevant section numbers, even though the 
translations of passages that often accompany the citations are my own. 


!AII three texts have been critically edited and published in the original Sanskrit in one volume by K. 
V. Sarma (with an introduction in English) [1]. For detailed accounts in English of some of the questions 
that TS is concerned with, see individual articles in [2]. 

This is not to be taken to mean that the two terms refer to identical processes, either logically or, 
given the very different approaches in the Indian and European traditions to the enterprise of acquiring 
and validating knowledge, epistemologically. It will not be misleading to think of yukti as proof by other 
logical means than the now-universally practised sequence of steps starting with axioms. This point has 
been the subject of much impassioned discussion recently. The reader will find a detached evaluation of 
the two standpoints in the writings of Roddam Narasimha, especially [3]. 

3For those who read Malayalam, there is the admirable and invaluable edition (of Part I) of Rama 
Varma Tampuran and Akhilesvara Ayyar [5] (cited often in the rest of this article as [TA]) with diagrams, 
annotations and explanations of the more obscure parts of the text, now 60 years old and out of print; 
every subsequent piece of writing on YB, without exception, owes a debt of gratitude to this meticulous 
work. In English, Sarasvati Amma's book [6] has an excellent account of the details of the calculus work 
of the Kerala school (among many other topics), largely faithful. Sarma has chosen to title his translation 
as Ganita-Yukti-Bhásá, a choice whose motivation is something of a puzzle — almost all contemporary 
references to the work and all known copies, whether on palm leaf or, relatively recently, on paper, use 
the simple title Yuktibhasa. This can be doubly confusing since there is a Sanskrit work under the name 
Ganitayuktibhasa which, according to Sarma himself (in the introduction to his own translation of YB), 
is a second rate rendering of the Malayalam work by an incompetent. 
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Indian mathematical reasoning is as striking in its avoidance of certain tech- 
niques, for instance reductio ad absurdum methods, as for its partiality to certain 
others. This article is about what we can learn from YB and its recapitulations of 
earlier material about one such widely favoured general procedure, that of recur- 
sion. Three broadly distinct manifestations of the general idea of recursion (for the 
sense(s) in which I use this term, see below, section 1.7)! find significant use in YB: j 
recursive description, recursive computation and construction and recursive proof. 
The term recursive description does duty also for what in moder logic will be called 
recursive definition — as is well known, Indian thought, and certainly Indian mathe- 
matics, did not much care for ab initio definitions. Moreover, it is not an infrequent ! 
experience while going through YB to come across recursive characterisations of 
mathematical objects, the coefficients in a power series for example, arising in the f 
course of a computation. This, and the fact that Indian proofs are largely construc- | 
tive, make it not always easy to disentangle description, computation/construction 
and proof in a given piece of writing. f 

Recursive ideas run through Indian mathematics from before the time of | 
Aryabhata, notably in the perfecting of the decimal place-value notation for the 
natural numbers. But it is in the intricate reasoning that goes into the development | 
of calculus that they finally find their most sophisticated expression, their natural i 
home as it were. This has a reasonable explanation. Recursive processes are, gener- 
ally speaking, nonterminating as is the infinitude of numbers; indeed, in the context 
of Sanskrit linguistics, it was recognised very early, presumably by Panini (Sth C. 
BCE) himself and certainly by his commentator Patanjali (2nd-1st C. BCE), that 
a science of language based on a finite set of recursive rules applied to a finite set 
of linguistic units has the capacity to describe its structure in all its unbounded 
richness. Patafijali’s graphic way of illustrating this principle is worth quoting (see 
Staal [7], p. 40): 


Now if grammatical expressions are taught, must this be done by the recitation 
of each particular word for the understanding of grammatical expressions ... ? 
No, says the author, this recitation of each particular word is not a means for the 
understanding of grammatical expressions. For we have a tradition which describes 
how Brhaspati addressed Indra during a thousand divine years going over the 
grammatical expressions by enumerating each particular word, and still did not 
attain the end. With Brhaspati as the professor, Indra as the student and a thousand 
divine years as the period of study, the end could not be attained . .: The recitation of 
each particular word, therefore, is not a means for the understanding of grammatical 
expressions. How, then, must grammatical expressions be understood? Some work 
containing general and particular rules must be composed. 


— ee eae 


— 


In the realm of mathematics, the discipline of calculus cannot, by the nature 
of its very content, but deal with the infinite and its opposite, the infinitely small. 


ee 
4Numbers in boldface refer to the sections and subsections of the present article. 
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It is no surprise then to find that YB accommodates and utilises infinitely iterated 
recursions in more than one role. But, as will become evident by the end of this 
article, not all of the recursive methods it calls upon are in response to the demands 
of the infinitesimal geometry which is the particular avatar that calculus takes in 
YB, nor do they all lead to infinite processes. What they do have in common is the 
constant presence of a certain spirit, that of the decimal place-value number system. 
Itis for this reason that the next section 2 is devoted to a retracing of this frequently 
traversed path, ahead of the technically more demanding mathematical aspects in 
the subsequent sections. In the present section 1, which is an overviw of the entire 
material, I have tried to avoid overly technical matters to the extent possible so as to 
give a reader who is not inclined to delve into mathematical details a general idea 
of the mode of thinking that animated Indian mathematics. 


1.2. Gestures and Sounds, Ritual and Language 


Going back as far as records and reliable tradition allow, three easily identified land- 
marks in the increasingly productive uses of recursive methods in Indian thought 
can be noted. The first began, probably before the earliest vedic works were com- 
posed, with the rules that govern all vedic rituals and effectively culminated in 
Panini's systematisation of the grammar and syntax of the Sanskrit of his time. 
An extensive ritual such as a yaga performed over several days is built up from its 
component elements in several stages of organisation and sequencing, the whole 
process controlled by strict rules which specify how subunits of the ritual, at various 
levels of organisation, may be combined, repeated, substituted, inserted one into 
another, etc., i.e., how they may be operated upon. That the number of repetitions 
of an operation is also specified we may take to be a performative limitation, not 
one of principle. Very similar rule-based structures are imposed also on the artic- 
ulation of sounds (mantra) that accompany the gestures and actions of the ritual. 
It would seem then that a ritual is fundamentally a ‘syntactically’ defined object, 
perhaps serving certain functional ends, its ‘semantic’ signifiance being secondary; 
in other words, we are talking about “rules (largely) without meaning”. One may 
even argue, more generally, that divorcing sense from symbol bestows freedom; that 
attaching meaning to each component part of a recursively constructed object or 
process, as opposed to an overall purpose, might well hinder the realisation, through 
a process of abstraction, of the full power and flexibility afforded by rules without 
meaning. Nothing illustrates the power of rule-based abstraction more dramatically 
than modern mathematcs. 


5It will be evident to the informed reader that my understanding of Indian rule-based thinking and 
its impact on mathematics has been strongly influenced by the work of Frits Staal. 

6A faithful and comprehensive record of an actual performance of a complex ritual is available in 
Staal's magnificent work Agni [8]. 
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As for Panini, the quotation above from Patafijali says it all, or virtually all, 
about the indispensability of having a finite set of rules which may be applied 
over and over again without bound as the only means of imposing structure and 
regularity on a natural language. It needs to be said that Panini's principles cover all 
aspects of Sanskrit as a language-in-use, their power and depth already evident at 
the elementary level of the small collection of sandhi rules which describe/prescribe 
the way words (primarily their sounds; meaning is hardly relevant) are combined 
and in the process transformed. It is possible, though not of great significance for 
us, that these and other rules were a conscious adaptation, from the ritual syntax that 
apparently pays little heed to meaning, to the setting of a natural language which 
cannot possibly dispense with meaning at a fairly low level of organisation, that 
of a sensible word. The other difference is that, as Patanjali's fable highlights, a 
language does not have any finiteness limitation. 


1.3. One Thing after Another... 


The second landmark is without doubt the adoption of the place-value system 
with 10 as base for the naming, writing and manipulation of integral numbers. 
It is fairly certain that this entailed a long drawn out evolution, beginning with 
the first mentions of number names in the earliest texts? and arriving at a mature, 
economical and consistent model for numbers by the time of Brahmagupta (7th 
C. CE). Along the way, all the features that mark what the Arab mathematicians 
called the Hindu numbers became incorporated: the role of zero both as a numeral, 
the one preceding 1, and as the cardinality of the empty set as in the place-value 
notation; the systematic ordering and naming of numbers going up to values beyond 
any practical utility, making the idea of the infinite almost palpable as one might 
say; fractions and the rules for operating with them; negative numbers and their 
arithmetic properties, and so on. It is this finished product that was transmitted 
to Europe from the Arab world, beginning in the 12th-13th century, by Leonardo 
of Pisa (Fibonacci) and possibly others. The fascination that the ‘discovery of the 
zero' has held for European historians has perhaps tended to turn our attention away 
from the role of these other elements in making the decimal system the universally 
adopted basis for human numeracy. Things are beginning to change now. (For the 
early history of the zero, see Staal [10] and for the impact that a proper understanding 
and use of negatives had on Indian arithmetic and geometry, see Mumford [11], 
both in this volume). 


7A succinct historical account will be found in the book of Datta and Singh [9] (Vol. I). Citing 
instances from the Rgveda in the well known Vedic Index of Macdonell and Keith, it says on page 9: 
“We can definitely say that from the very earliest known times, ten has formed the basis of numeration 
in India”. 
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It is often assumed that the idea of a place-value number system came to India 
from the Babylonian sexagesimal numbers. The reasons for the belief are, firstly, 
that the sexagesimal system had attained a degree of completion by about 1800 BCE, 
some centuries before the first vedic mentions of numbers in the Rgveda. Then, the 
decimal notation itself is foreshadowed in the Babylonian use of a single character 
for 10 in the writing of sexagesimal numbers. Finally there is the prevalence in 
Indian astronomy of successive divisions by 60 in measuring time as well as the 
circumference of the circle. There may well be an element of validity in each of 
these reasons but, given our poor knowledge of the relevant history, none can be 
taken to be a decisive argument. For instance, the astronomical use of 60 can only 
be dated back to Aryabhata (born 476 CE) who, on the time scale of interest here, 
is a late figure. It is perhaps not an accident that the advent of 60 in the study of the 
(celestial) sphere and the measurement of time coincides with the sudden blooming 
of astronomy associated with him, plausibly through contact with the Alexandria 
of Ptolemy, and had no influence on a pre-existing decimal system — nothing non- 
astronomical has ever been counted in units of 60 in India. The prudent conclusion 
has to be that we know too little about the historical factors bearing on the issue 
to take a position, keeping in mind the risk of mistaking causality (the principle 
that says that the past cannot be influenced by, but may influence, the present) for 
causation. I shall return to the question of the uncertain history later (section 2.4). 

Whatever its antecedents, the decimal system is the first purely mathematical ex- 
pression of the power of recursive construction in the Indian context. Its significance 
for all mathematics, indeed for virtually all human activity, and for all time to come, 
can only be described as seminal. Even though it took some four centuries after its 
introduction by Fibonacci for it to definitively displace the laborious methods of 
doing arithmetic practised by European savants, the speed of its subsequent spread 
was rapid. By the end of the 18th century, Laplace was writing enthusiastically about 
it, as in the oft-quoted tribute: "The ingenious method of expressing every possible 
number using a set of ten symbols (each symbol having a place value and an absolute 
value) - - - seems so simple nowadays that its significance and profound importance 
are no longer appreciated” and, further, “The idea of expressing all quantities by 
nine figures whereby is imparted to them both a position and an absolute value is 
so simple that this very simplicity is the reason for our not being sufficiently aware 
how much admiration it deserves". An even more significant tribute to its power, 
not so much in the eloquence of his words as in the remarkable use he put it to, came 


SPost-revolutionary France officially adopted (in 1799) decimally ordered (metric) units for length 
and mass mainly as a result of the advocacy of Laplace (along with Lagrange). The sexagesimal division 
of time was obviously too well entrenched to be meddled with. The opposite happened in India much 
earlier: the adoption of the new sexagesimal time divisions in the 5th century CE barely touched the 
decimally organised counting numbers dating back to the Rgveda or the possibly equally ancient but 
less systematic multiples and subdivisions of lengths and weights. 
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a century earlier from Newton. Understanding that use, in calculations with power 
series, needs the context to be prepared and I will come to that in a while. 

What the decimal system first does in practice is to take a notion of a (countable) 
quantity, which humans apparently have an innate faculty for when the numbers 
are small but becomes uncertain and hard to manage as they increase. and give 
it the same capacity for precision no matter how large the numbers are. That this 
problem was a preoccupation is clear from a number of early vedic passages. Some 
of them refer to situations in which numbers are matched or paired off without 
their magnitudes being specified, implying an understanding of when two numbers 
are equal even if each is not known ‘absolutely’, which is a plausible starting 
point in trying to quantify a discrete set. Then we have the verse from the Rgveda 
quoted by Staal in [12] and first highlighted by Renou, about the fire god Agni 
being the only being capable of apprehending precisely, we are not told how, large 
numbers. Over a period of time, but metaphorically at one stroke, the new way of 
counting gave people (and gods, not counting, naturally, Agni), the means to deal 
with arbitrarily large numbers with the same facility with which they dealt with 
small numbers. (That decimal counting did not completely supplant comparison 
counting at one stroke may be the message of the well-known passage from the 
Yajurveda in which a list of powers of 10 goes hand in hand with a pairing off 
of cows and sacrificial bricks: for the full passage, see for instance [13], p.131). 
One might say, from a modern viewpoint, that it replaces an abstract recursive 
definition — no one does any arithmetic starting from the Dedekind-Peano axioms — 
by a recursive constructive procedure of limitless power. Not the least significant 
aspect of this taming of numbers is that it goes hand in hand with their naming, 
by resorting to a relatively small collection of simple proper names for the atomic 
numbers? 1 to 9 as well as for 10 and its powers, supplemented, ideally, by two 
clearly differentiated rules for combining these names, one for addition to and the 
other for multiplication of powers of 10. And to name is to bring into being, to 
make exist. The wonder of this achievement is still alive in the pages of YB: in the 
very first section, after quoting a verse from Bhaskara II on the names of powers 
of 10 upto pararddham (10!”), there occurs the statement, “If we endow numbers 
with multiplication and positional variation (sthanabhedam), there is no end to the 
names of numbers; hence we cannot know [all] the numbers themselves and their 
order" [Sarma 1.4]!°. The rest of the opening chapter is then devoted to a concise 
but complete account of the basic arithmetical operations on decimally written 
numbers. 


More generally, in a place-value system with base b, I use this term for the numbers 0 to b — 1, 
mindful of the association of the term ‘digit’ with the base 10. 


l0My translation; Sarma's version deviates apparently slightly but, for our purpose, crucially. - 


Throughout this article, all unattributed translations are mine. The Sanskrit-Malayalam words and 
phrases cited are spelt as they are in YB and in Malayalam generally. 
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1.4. The Infinite and the Infinitely Small; Calculus 


Georges Ifrah's book [14] has a table giving the names of powers of 10 which 
goes up to 10?! , While it is difficult to be extremely precise about the antiquity 
of the very many lists of names of powers of 10 made by Hindus, Buddhists and 
Jainas alike (see for instance [9] or Hayashi's book on the Bakhshali manuscript 
[15]), all of them probably antedate the 3rd century CE or so!! by which time 
sectarian Buddhist texts were being written in prolific quantity, many containing 
cosmogonic speculations or stories about the omniscience of the Buddha involving 
large, enormously large, numbers. (If anything, the Jainas were even more number- 
happy). For a while, it would appear that such semi-quantitative ideas of the very 
large coexisted with qualitative notions of the infinite as expressed in the oft-quoted 
upanisadic passage concerning the concept of pūrnam (full or complete) as the 
quality of being unchanged when something is added or taken away, or in the 
early Buddhist canonical references to the Buddha's agnosticism about the spatial 
and temporal boundedness or otherwise of the universe. The number lists are just a 
witness to the dawning realisation that there was now a bridge to infinity, constructed 
span by decimal span; a bridge without end, but one that would carry them as far as 
they wished to go. 

By the time of Bhaskara II (12th century CE), Indian mathematicians, who were 
also astronomers most of them, no longer felt the need to glory in their mastery 
of astronomical numbers. The list of numbers in his Lilávati stops at pardrdha 
(which is now 107) and for YB, 10!” is a perfectly acceptable point at which to 
halt in the interminable journey and do the rest of it in the imagination: YB leaves 
no doubt that it understands the resort to a fixed large number, as a substitute 
for an infinitely large one, to be a matter of practical convenience and ease of 
explanation"? 

At the other extreme, the infinitely small caused no difficulty at all, once the ap- 
proach to infinity itself was brought under control: as a number was taken to larger 


!!Some of the more modest lists are in Hindu texts that are very much earlier — the Yajurveda has a 
list [9] going up to parárdha which, if the lists name all the consecutive powers of 10, stood at that ume 
for 10. There are in fact many lists of varying antiquity and provenance [14, 15] and they do not all 
agree on all the names. 

12The first ‘main theorem’ of the calculus part of YB is the infinite series for æ. The derivation of 
the series proceeds, as is well known now, by dividing half the side of the circumscribing square of the 
unit circle (= tan 7/4) into n segments and letting n tend to infinity (see further down in this subsection). 
In the account of this procedure, there occurs a passage which says in part. “The smaller the segment, 
the more accurate the result. Therefore imagine that each side is cut into atoms and then carry out the 
summation. For this, if the division is by pararddham.. . . ., also divide by pararddham’’. It then goes on 
to say that to get the exact value of the integral (samkalitam), the division has to be endlessly fine. For a 
translation of the full passage, see the section entitled “Integrals of Powers” of my article [16] or section 
6.4.1 of [Sarma]. 
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and larger values, its reciprocal came closer and closer to zero. In a general sense, di- 
vision (at least by a non-zero number) and the concomitant fractional numbers seem 
to have posed no conceptual problems in India from an early time, just as subtraction 
and negative numbers did not. Even in YB, the operation of division is dismissed in 
a very brief paragraph after an explanation in detail of several different ways of car- 
rying out multiplication. Similarly, the chapter on fractions is only concerned with 
the mundane computational aspects of extending the basic operations on integers to 
fractions — mostly about finding a common denominator. When these operations are 
subsequently applied to geometric quantities, we see the same pragmatism in play. 
A segment of arbitrarily small size of a straight line or an arc was arrived at simply by 
cutting it into a number n of equal pieces and then letting n increase without bound. 
We might say that the only limit that explicitly occurs is the limit — oo. never the 
Newtonian limit x — 0; no philosophical energy needed to be spent on defining or 
describing ab initio the notion of an infinitesimal. The contrast with the European 
story of calculus, where ‘local’ questions such as the problem of tangents (finding a 
sequence of chords to a curve that approached the tangent at a point) and of extrema 
(determining the points at which a function or its curve attained its local maxima 
and minima) were from the beginning in focus, is clear enough to need no further 
comment. 

The derivation of the series for z is a good illustration of this approach.'? 
The problem is that of finding the length of (or rectifying) an arc of the circle 
(of unit radius) equal to one-eighth of its circumference, i.e., subtending an angle 
0 = zx /4 at the centre, and it is in its solution that the numerical series for z and 
its generalisation, the arctangent series, (described later in section 3) first saw the 
light of day. Schematically, the method consists in cutting up the unit tangent to 
the circle at any point (tan z /4 = 1) into n equal parts, approximating the arc- 
segement corresponding to each tangent-segment by its chord (more accurately, 
by half the chord of twice the arc, a little twist that may justly be considered the 
founding step of Aryabhata's trigonometry), adding them up and finally letting n 
grow large. Right at the start of the description, there is the remark, “The greater 
the number [i.e., n], the more accurate the circumference”. (YB treats the case of 
the general arctangent series later by means of an elementary property of similar 
triangles; here I do not distinguish between the general and the particular cases). 
So it is clear that the procedure was effectively understood to be what we will 
consider today a specially simple instance of the use of the fundamental theorem of 
calculus: to determine a function explicitly, first differentiate it and then integrate 
the derivative. But the procedure did not, as it did not later for Gregory and Leibniz, 


'3Most of the rest of this section is, unavoidably, about the actual mathematical content of YB. 
Though I have kept equations at bay, it will be useful to have in hand an account of the mathematics 
as given for instance in the supplement of explanatory notes [17] by Ramasubramanian, Srinivas and 
Sriram (abbreviated from now on to [RSS]) in [Sarma]; better still, come back here after reading the rest 


of the present article. 
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give a closed expression for the circumference. A variety of recursive tools are now 
put to work. First, each term in the sum over the segments, before the large n limit 
is taken, is itself turned into an infinite series by means of an elementary recursive 
identity, a result which continues to hold in the limit. (The recursive identity will 
get its fair share of attention in the following sections). Though there is nothing 
said about the possibility of the series not converging at this point (it is dealt with 
subsequently), the student/reader is left in no doubt that the sum of the whole series 
is the exact length of the arc which one may then approximate by truncation at 
any point. In this step, the debt that the free use of the (actually) infinite series 
owes to the (potentially) infinite series of powers of 10 which are the decimal 
representations of integers appears direct and undeniable. Over a century after the 
writing of YB, the story repeated itself in Europe in a different guise when Nicolaus 
Mercator and Isaac Newton first wrote down the logarithmic series and the binomial 
series. Newton felt the need to provide a rationale for his formal manipulation of 
infinite series from “the doctrine recently established [ ! ] for decimal numbers" 
in a revealing passage from De Methodis Serierum et Fluxionum written in 1670- 
1671 (translated by Whiteside [18], vol. III, p.32) and it merits being quoted in 
extenso: 


Since the operations of computing in numbers and with variables are closely simi- 
lar — indeed there appears to be no difference between them except in the characters 
by which quantities are denoted, definitely in the one case, indefinitely so in the 
latter — I am amazed that it has occurred to no one (if you except N. Mercator 
with his quadrature of the hyperboia) to fit the doctrine recently established for 
decimal numbers in similar fashion to variables, especially since the way is then 
open to more striking consequences. For since this doctrine in species has the same 
relationship to Algebra that the doctrine in decimal numbers has to Arithmetic, its 
operations of Addition, Subtraction, Multiplication, Division and Root-extraction 
may easily be learnt from the latter's provided the readers to be skilled in each, 
both Arithmetic and Algebra, and appreciate the correspondence between deci- 
mal numbers and algebraic terms continued to infinity; namely that to each single 
place in a decimal sequence decreasing continually to the right there corresponds 
a unique term in a variable array ordered according to the sequence of the dimen- 
sions [meaning the powers! of 10, ‘places’] of numbers or denominators [pre- 
sumably, in fractions in decimal notation; Simon Stevin had already decimalised 
fractions] continued in uniform progression to infinity (as you will see done in the 
sequel). 


141 am grateful to David Mumford who reminds me that this usage of the term ‘dimension’ derives 
from Greek geometry in which squares and cubes of lines referred to 2- and 3-dimensional spaces — 
Newton just extended the terminology to numbers and to higher powers. 

I5For the structural reasons that make this more than just an analogy, see my article [16]. At the time 
of its writing I was ignorant that Newton had actually been inspired by the decimal system in his use of 
power series just as was, I had suggested, Madhavan. 
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For Newton this was no mere contingent justification. Six years later, in the earlier 
of the two letters to Oldenburg for forwarding to Leibniz, the Epistola Prior ((19] 
Vol.I, p.32]), he says: 


Fractions are reduced to infinite series by division; and radical quantities by ex- 
traction of the roots, by carrying out these operations in the symbols just as they 
are commonly carried out in decimal numbers. 


What Newton had in mind clearly is that a decimally represented integer can 
be thought of as the value of a polynomial when the ‘variable’ is fixed at 10: if 
c; is the entry in the ith place of a number n, then n is the value of >), cjx' at 
x = 10— one has of course to keep in mind that the entries in a decimal number are 
restricted to the digits 0 to 9 and so the correspondence requires the carry over rules 
to be imposed on them when two numbers are added or multiplied. Further down 
(section 5.1), we shall see how just this correspondence is utilised by YB to model f 
the algebra of polynomials in one unknown (an arbitrary positive odd integer) on i 
decimal arithmetic. 


1.5. A Recursive Proof 


The use of the recursive identity mentioned above is not an essentia! part of calculus 
per se. It is an auxiliary or enabling tool in much the same way that binomial series 
were for Newton. (Special cases of the infinite series generated by the identity in I 
fact coincide with special cases of Newton's binomial series, one specific instance l 
being the present case of the arctangent series). Its use leads to a representation of i 
each term in the series as a finite sum which in turn will become an integral, the | 
integral of a positive even power, in the limit of large n. These integrals are still to 
be evaluated and the next key step forward, that of a recursive proof, makes its first 
appearance here. ! 
In a daring step announced with little fanfare,'^ YB recognises that the way | 
to evaluate the even integrals is i) to consider the integrals of all nonnegative 
powers, both even and odd, and ii) to relate the value of the integral of a given | 
(kth) power to that of the (k — 1)th power, i.e., an even (odd) term is obtained | | 
from the preceding odd (even) term. This is an astonishingly original thing to | 
have done. Nothing that went before in Indian mathematics prepares us for this 
step, which should not be surprising since what the step encodes is the principle | 
of integrating by parts or, equivalently, the rule for differentiating products. It 
|. 
I 


l6yB has a very matter-of-fact and pedantic style; none of the light-hearted asides favoured by 
other writers, such as addressing questions and explanations to the slow-witted or the adept — Bhaskara 
II poses one problem to a “ proud expert in algebra" (bije patutabhimanah) — can be found in its 
Pages. 
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is also a conscious application of the principle of induction, in itself a (if not 
the most) sophisticated manifestation of the power of recursive methods. What 
makes it work is that the recursive step is generic: the procedure that generates 
the kth term from the (k — 1)th term does not depend on k. The first step, that of 
considering all k, no matter how large, is thus forced by the desire for an exact 
answer and the resulting recursive nature of the proof. Or, perhaps, it is the other 
way: the impulse is the recursive mindset and it is that which makes an exact answer 
attainable. 

All of this is covered in chapter 6 of YB, which then concludes by treating a 
practical problem raised by (convergent) infinite recursion: how can one extract 
useful numbers from a procedure that is endless? If the successive terms in an 
infinite series diminish in value rapidly, one can, optimistically, be satisfied with 
truncating the series after a finite (small) number of terms, the number depending 
on the rate of convergence and on the accuracy demanded. The arctangent series 
converges notoriously slowly. YB has some quite remarkable ways of accelerating 
the convergence by a process of reordering the terms, achieved by the use of a 
method of recursive correction. It also poses the question of how one may estimate 
the remainder left out by a truncation at the ith term, i.e., the difference between 
the exact value (the sum of all the infinite number of terms) and the sum up to i 
terms. The impressive way this is done, no longer to our surprise by now, is again 
recursive in content but with an interesting modification. A first guess is made about 
its value, which is then fed into the equations satisfied by the remainder to generate 
a ‘truncation error’ whose value (sthaulyam, grossness) is used as a guide in fine- 
tuning the initial guess rather than as the input in the next iterative step. The reason 
is that this would-be recursion within a recursion becomes impossibly complicated 
to implement after the first stage. The step that is iterated is the one of going back 
to the first guess and changing it judiciously. Already in two or three such iterations 
very precise values for z are produced. 


1.6. Solving a Differential Equation 


The bulk of chapter 7 of YB is devoted to a problem that goes back to Aryabhata, 
that of expressing the sine of an arbitray angle in terms of that angle or, in geo- 
metric language, given an arc, to find an expression for half the chord of twice the 
arc. In a famous (semantically nonsensical) verse (Gitikapada 10 of Aryabhatiya, 
abbreviated to AB), Aryabhata sets out the results of his computation in the form 
of a table of sines for 24 equally separated angles in the first quadrant, i.e., in 24 
steps of 7/48. He himself in an opaquely worded stanza (Ganitapada 12 of AB) and 
his immediate successors in more transparent words, Siryasiddhanta for example, 
have indicated the procedure used to make the table. With help from later works, in 
particular YB, the principle behind the construction can be described qualitatively 
as follows. 
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When the angle is close to zero, the value of its sine is close to the value of the angle 
if angles are measured, as is commonly done today, in natural units. namely radians. 
Equivalently, both the angle and the sine can be expressed in sexagesimal minutes as 
Aryabhata did. (This was a nontrivial matter; Ptolemy's astronomical calculations 
were made exceedingly involved by his partial use of degrees and minutes). To the 
accuracy that Aryabhata worked to, which was a minute of arc, 7/48 was small 
enough for this to be true as could be verified by easy computation, for example 
starting with z /6 and halving it repeatedly three times. Thus the first two entries in 
the sine table were filled. The next step required a notion of sine differences, namely 
the difference between the sine of a given angle in the table and its predecessor (this 
was what Aryabhata actually tabulated in his verse), and then, of their second 
differences, namely the difference between two successive such sine differences. 
Elementary circle-geometry then leads to an equation for the second difference. 
The recipé outlined in Süryasiddhanta is most simply interpreted as arising from 
an approximate recursive solution of this difference equation, terminated after the 
first iteration, which is accurate to the required degree (or, one should rather say. to 
the minute), an interpretation well supported by the relevant portion of YB (section 
7.4.2 of [Sarma]).'" I am conscious of the inadequacy of this summary of one of the 
cleverest parts of early Indian mathematics but, once again, invite the reader to go 
to the technically complete account in section 4.2. 

The journey from Aryabhata's small angles and sine differences to the infinitely 
small angles and sine differentials of YB took close to a thousand years and also 
took some false turns.'® But the recognition that the equality of an angle and its sine 


174 clear and early (1795) exposition in English of this material is that of the Scottish geometer and 
historian of geometry, John Playfair [20] (He of the parallel axiom but also, more pertinently for us, 
one who closely followed the work of H. T. Colebrooke on Indian science. Charles Whish, the first to 
write in English on some of the Kerala texts including YB, was also strongly influenced by Colebrooke). 
Only in one respect is Playfair slightly off the mark: he conjectures that "the brahmins" had in hand the 
geometrical equivalent (Ptolemy's theorem etc.) of the addition formulae for the sine and the cosine. 
That is unlikely; what they had was what they needed, the formulae for small increments (see section 
4.2). I am grateful to Roddam Narasimha for the Playfair reference and for making me appreciate its 
historical importance in the reconstruction of Aryabhata's sine table. 

'8Ganitapada 11 of AB is a strong indication that Aryabhata himself understood perfectly well that 
his procedure was an approximate one and that it could be made more accurate if desired by a finer 
division of the circumference. But, until we get to Kerala, his followers seem to have led themselves 
astray. In his bhdsya of AB, Bhaskara I asks the intriguing question: does a circle have an arc equal to its 
own chord? and answers: yes, otherwise an iron ball resting on the floor will not be stable; such an arc will 
be (1/100)th (a fraction smaller than (1/96)th!) part of a circle (see [21] for instance). More interestingly, 
Bhaskara II arrived at the correct formulae for the surface area and volume of a sphere by dividing a 
great circle into 96 equal parts and effectively equating 2x /96 = z /48 to its sine (see Sarasvati Amma's 
book [6] for the details). We have to conclude, on available evidence, that little conceptual progress 
was made in the time intervening between Aryabhata’s cryptically expressed realisation of the need for 
indefinitely continued subdivision (Ganitapáda 11) and Madhavan’s full-fledged calculus of the circle. 
I hope to return to the question of the prehistory of Indian calculus elsewhere. 
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is only a limiting property of angles tending to zero, presumably by Madhavan, and 
its being put to use to transform Aryabhata’s approximate table into Madhavan’s 
exact series seem to have been almost simultaneous. There are two parts to this 
transformation. First, Aryabhata’s equation for the second differences was shown 
to become exact as the step size was taken towards zero becoming, automatically as 
it were, the second order differential equation satisfied by the sine (and the cosine) 
function — this is calculus pure and simple, exactly as we might proceed today. But 
the recursive solution of the equation now has to be carried through all the way, 
through an infinite number of iterations, to get an exact expression for the sine; 
that is precisely what YB does, resulting in the famous series in powers of the 
angle. 

There are several extremely ingenious steps involved in this progress from 
Aryabhata (finite differences and approximate numerical solutions) to Madhavan 
(differentials and exact analytic solutions), some of them wonderfully illustrative of 
the flexibility of recursive techniques — most remarkable is an instance of a doubly 
inductive proof which will get its deservedly full treatment later in section 4.4. 
But none of that should blind us to the revolutionary new way of doing mathe- 
matics that made it possible in the first place, the way of calculus that Madhavan 
pioneered. 


1.7. Recursion and Rules of Composition 


The following sections aim to provide the mathematical details of some of the more 
striking uses to which recursive methods are put in YB (as well as some historical 
background where relevant and available), after first taking a closer look at that 
model recursive object, the natural numbers as expressed in the place-value nota- 
tion. The approach will be almost entirely descriptive. There will be nothing here 
relating to the elaborate theoretical framework(s) constructed around the general 
idea of recursion in linguistics, logic, foundations of mathematics, computational 
theory and so on, the main reason for the reticence being lack of knowledge and 
competence. But it is not an unreasonable stance to adopt when dealing with In- 
dian mathematical thought which famously (or notoriously, depending on one’s 
epistemological leanings) declined to take categorical positions in regard to issues 
of definitions, axioms, rules of deduction, etc., etc. It is not that Indian men of 
learning were indifferent to the reliability or otherwise of their ways of establishing 
what is true — we have only to read Nilakanthan to know that methods of acquiring 
and validating knowledge, both empirical and theoretical, were a constant concern 
among, at least, astronomers and mathematicians. Nothing was self-evident to these 
inheritors of a tradition, going back two millennia or longer to the time of the later 
upanisads and the $ramana movement, of questioning and sceptical enquiry. Their 
position appears to have been that an excessive preoccupation with first principles, if 
there be such, was a distraction from the more important business of understanding 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Notes on Yuktibhasa: Recursive Methods in Indian Mathematics 301 


the world as it is.'? They might have been intrigued but not surprised if they could 
have known that present-day mainstream scientific practice, after having explored 
the axiomatic methodology to its limits, has gradually been turning towards a more 
easy and pragmatic view of such foundational questions; that these are issues the 
practitioners leave to others to worry about or, at worst, think about on Friday 
afternoons. 

Having said this, this very spirit of realism should persuade us to try and evaluate 
the place of recursive methods, in the very broad sense in which we take them, as a 
productive means of creating interesting new mathematical knowledge. In his work 
on the structure of rituals (and, inevitably, its relation to the structure of language), 
Frits Staal ([7], p.42) characterises a recursive rule as one which “can be applied 
to its own output" and so “can produce infinitely many forms". Intuitively, this is 
an appealing definition, general enough to embrace the instances that we are going 
to be concerned with. But when dealing with a developed mathematical science 
from our current perspective, it will need to be both sharpened and broadened; the 
emphasis on infinity, for instance, will have to be relaxed. It can well (and does) 
happen that, for any chosen initial input, the output after a certain finite number 
of steps coincides with the initial input or the input/output of a preceding step. An 
exact recursive solution of the exact difference equation for the sine will be a case 
in point — if the incremental angle is 2z /N for example as it was for Aryabhata, 
the output of the Nth step will be the same as the initial input because of the 
periodicity of the sine function. More particularly, such cyclic recursions will occur 
whenever the mathematical structure we are computing in is defined on a finite set 
(but finiteness is not a necessary condition). It would be pointless to go through the 
same cycle over and over again, learning nothing new, but it is not a serious matter 
in itself (except, perhaps, for a badly programmed computer which has not been 
told when to stop); the interest that a recursively generated result may hold is not 
necessarily in proportion to the number of steps needed to get to that result. 

A more pertinent, broader, issue is that the manner of our response to modes of 
thinking from the past is necessarily subservient to a certain tyranny of progress. 
No historian can fully unlearn what he has learned of subsequent events, however 
conscious he may be of the imperative of placing himself in the epoch. In the case of 
recursive mathematical techniques, this psychological barrier is a particularly diffi- 
cult one to penetrate. The reason is simply that an essential part of the progress that 
tyrannises has consisted in reorienting mathematics towards an abstract structural 
mode of thinking even when the particular problem at hand may arise in a concrete 
context. Such structures are defined completely by rules governing the operations 


CORTE e 
- Mp some of his more philosophical writings, Nilakanthan gives us a clear idea of the epistemological 
moorings of his work and, presumably, that of others of the Kerala school; especially revealing are his 
remarks on the futility of seeking logical first causes. These writings have recently been critically 


examined by Roddam Narasimha in a series of articles which can be retraced from [3]. 
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that can be performed on the elements of a given (structured) set, in particular rules 
of composition associating to every pair of elements precisely one element of the 
same set. Since the rule of composition is defined on every pair of the set, it follows 
that it can be applied to its own outputs over and over again. More generally, a 
problem of interest may involve a collection of such structurally defined sets with 
rules governing ‘maps’ from one set of the collection into another (including com- 
position within one such set), which can themselves be successively applied subject 
to various conditions of compatibility. In short, recursive operations are embedded 
in the very genes of modern mathematics, their deployment routine and automatic; 
progress has turned the creative into the commonplace. But they do still catch the 
eye when a description or a proof depends critically and explicitly on the recursive 
properties of, say, composition rules. Instances are easy to come by. In the area of 
recursive proofs in particular, the very general technique of proof going under the 
name of mathematical induction is without doubt the prime example. Here let us 
only recall that inductive proofs proceed by establishing a correspondence between 
an infinite number of propositions and the recursively defined natural numbers. 

The abstract structural vision that animates much of modern mathematics was 
of course not even a gleam in the eye of Madhavan and the school he founded. Nev- 
ertheless, in assessing the special role that recursive reasoning played in their work, 
we have little choice but to use the same criterion that we would in a contemporary 
situation: how good is the mathematics they produced and could they have reached 
their mathematical destination by a path that was not the one they so consciously 
built, one step after another? Were there other more banal, ‘natural’, means available 
to them which would have done the job? The quality of the mathematics is by now 
self-evident. As for the method, I hope a reader who persists to the end will be 
convinced that the techniques they perfected and used so imaginatively and to such 
good effect were a natural continuation of the entire tradition of Indian mathematics 
and that there was no other way conceivable by them that would have taken them 
to their goal. It is in this perspective, as an account of the culmination of a long 
evolution, that this essay should be viewed. 


2. The Grammar of Numbers 
2.1. Seeing and Counting, Naming and Knowing 


Measurement and quantification are an essential part of the means by which rational 
humans have sought comprehension of the world they live in. The ability to quantify 
is a direct prior factor in our understanding of ‘nature’, the world in all its aspects; 
the interrogations “how many?” and “how much?’ precede “how?” and “why?”. 
And counting is the first, the most primitive, measurement there is. All possible 
answers to “how many?” are in principle definite and absolute. They do not need 
to be compared with externally prescribed standards for them to make sense in our 
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own minds or to be communicated to others: no balance, meter stick or clock is 
required in the process of counting. 

But this is a hard-won autonomy it would seem. By all accounts, early humans 
had no problem with counting small numbers — they ‘took it in at a glance’. Staal 
[12] quotes Louis Renou to the effect that in the Rgveda the verbal root khyd, from 
which comes samkhyd meaning ‘number’, signifies ‘to look’ and that it is only in 
later vedic times that it acquired a numerical connotation. When it came to larger 
numbers, the early vedic people apparently (going by the cows-for-bricks passage) 
had to resort to comparison and correspondence, not with a fixed standard number, 
but with numbers chosen by the situation, a sort of ‘weighing’ of pairs of numbers 
in an abstract number-balance as it were.?? That this was so is entirely in the nature 
of things; comparison is still the only way we have of quantifying magnitudes such 
as length or time which do not come in multiples of an irreducible, universal unit. 
If the Planck length had a human scale, the physical world would be discrete in the 
large and all measurements would consist of counting; vedic geometry certainly, 
and perhaps even Greek geometry, would not have had to be invented. 

How large had a number to be for people to fail to ‘see’ it? From the historical 
evidence we have, we might guess, not beyond 10—even the Babylonian sexagesimal 
system had a single separate symbol for 10. 

We can take the passage from the Rgveda about Agni, “ He has seen or counted 
(people) like one observes groups of cattle with an owner of cattle" (Staal's trans- 
lation [12] from Renou's French) in two senses: that Agni could see ‘at a glance’ or 
count the number of people as he could the number of cattle, or that he could count 
while others merely saw. In either sense, Agni could count, presumably beyond ten. 
One must assume that, for those who could not, the capacity to match went with the 
capacity to distinguish between ‘more’ and ‘less’ — to draw a meaningful conclusion 
from observing which pan of the number-balance went down — and by how much, 
as long as the difference was, again, small. That would introduce a semiquantitative 
(because of the limitation to small numbers) notion of order among numbers but 
would still not bring a mortal the divine gift that Agni possessed. The way for- 
ward from here is the obvious recursive one of extending the ordering by repeated 
comparisons. It is difficult at this point not to jump ahead and note that it is this 
imagined primitive idea of order, limited in the beginning to nearby numbers and 
subsequently extended by recursion, that is captured by Peano in full quantitaive 
generality in the succession axiom. But although the Peano axioms give a logically 
complete characterisation of natural numbers, they still do not help directly either 


20tt is of course acknowledged that drawing conclusions about the evolution of ideas and practices 
from early vedic sources is a hazardous business simply because of the strong possibility of a long 
time-gap between their actual composition and their consolidation in the samhita. Along with examples 
of primitive, comparison-based, enumeration the Rgveda, for instance, also has a rich repertoire of 
decimally designated number names. 
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to take in large numbers ‘intuitively’ or, in practical terms, to do arithmetic; to 
implement this last step we are once again obliged to go back to the primitive idea 
of succession and build everything up from the operation of adding 1 at a time (see 
section 2.3 below). 

“There is no end to the names of numbers; hence we cannot know the numbers 
themselves and their order”. To know numbers and to use them, we have first to give 
them names, an identity that will assign each of them its exact place in the order of 
all numbers and that will let us know instantly what that exact place is. For it to serve 
its purpose the nomenclature has to be rule-based — in order to let mere mortals see 
where a number denoted by a name stands in the order — and economical — so as not 
to tax their finite powers of memory. We might well paraphrase Patafijali and say, 
“How, then, must the meanings of numbers be understood? Some work containing 
general and particular rules must be composed". We might go further and formally 
prescribe the “general” rules as the rules designating a number N by the unique 
ordered sequence of atomic numbers (including 0) no, 1, ... which constitute its 
place-value representation, namely the coefficients of the powers of the chosen base 
b, not necessarily 10, in the sum 


N — no x (b? = 1) +m x (b! 2 b) - no x (b> — b x b) -..., 


expressed orally or in writing or both. In case the expression is verbal they need 
to include also the rules by which the number is given a verbal identity, which is 
accomplished by a set of names for atomic numerals juxtaposed in the correct order 
by use of joining or compounding rules applied at both the semantic (samasa) and 
the syllabic (sandhi) levels. (If the representation is written down in symbols as 
in Mesopotamia, names are strictly speaking not necessary, but those who wrote 
and read them probably talked to one another). Thus made tangible, numbers are 
the first, the most primitive, concept in Indian, perhaps all, mathematics.?! “That 
which is the particular study of numbers relating to enumerables (samkhyeyam) is 
mathematics" (YB, introducing the place-value system and the rules of arithmetic). 
As for "particular" rules, the natural candidate is the collection of the laws of 
arithmetic worked out within the place-value representation, including the carry 
over or overflow rules. The particuar rules cannot be given in a concise or even 
practical way without the general rules having been established. How will one teach 


?VThere is good evidence that numbers have priority over geometry in the vedas not only conceptually 
but also chronologically. The Sulbasütra manuals containing the earliest accounts of geometry in India 
cannot be dated earlier than about 700 BCE while number names with a distinct decimal structure occur 
freely already in the Rgveda. For all matters concerning the chronology of vedic tmes, I have relied on 
the synoptic account of the current state of knowledge given in [13]. As for the earlier Harappan culture, 
we can say little definite about their numbers (see footnote 23 and section 2.4 below), but a fair picture 
of its geometric sophistication is now beginning to emerge [22]. It is, however, not ruled out that there 
may be a degree of continuity between Harappan geometry and that described in the Su/ba texts. 
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or learn, for instance, the multiplication table in Roman numerals except, as with 
Brhaspati and Indra, by endless enumeration? 


2.2. The Power of 10 


One begins, then, by giving names to the atomic numbers | to 9 (in the decimal base 
case), eka, dvi, ---, nava and the base itself, 10, dasa. Ideally, 11, 12,...,can now 
be designated by an application of the joining rules: one-plus-ten (ekadasa), two- 
plus-ten (dvadasa) and so on till we get to 20 at which point, a different joining rule 
should come into play to create names for two-times-ten and similarly for 30 and so 
on. Combining these two joining rules, we have a system that, in principle, names 
all numbers up to 99 and it requires exactly ten distinct names and the convention of 
thetwo joining rules to be remembered. When it comes to 100, we have options. The 
first is to express it as the sum or product (or a combination of the two) of smaller 
numbers in very many different ways using the supposedly already acquired rules 
of arithmetic, an option that is available for numbers less than 100 as well but 
generally not used. Apart from the lack of uniqueness and the consequent absence 
of a repeatable structure in the naming rules, such an ad hoc system is bound to reach 
its limit sooner or later and new number names and more ad hoc rules will have to 
be introduced. The preferred Indian way was to give a new name to every power of 
10 so that, with minimal demand on resources, the naming method that works up 
to 99 can be indefinitely extended. It is also a fully recursive approach to naming 
and, so, to counting — one starts with a finite ordered set of names for the lowest 
nine of the ordered set of all numbers and two linguistic rules of composition (the 
joining rules); the repeated application of those rules together with progressively 
less frequent new names for the powers of 10 generates the decimal nomenclature 
of every number. 

That is the ideal. In practice the ideal was never fully realised. It is not in 
the nature of natural languages to be completely logical and Sanskrit, vedic or 
later, is no exception. In the event, there have always been deviations from the 
ideal in the way numbers were designated, most markedly in the proliferation of 
synonyms for the same number, often arrived at by decomposing it as sums and 
products in different ways.” At the same time, astronomers and mathematicians 
from Aryabhata onwards were constantly on the lookout for a rule-based, symbolic 
but verbalisable representation that would not lead to very long names for big 
numbers and so not inconvenience the versifiers unduly. The goal was achieved 


22The book of Datta and Singh [9] has an extensive discussion of number names and number symbols 
and of possible reasons for some of these deviations. They suggest in particular that the use of different 
numerical expressions for the same number as well as that of bhütasamkhya, which is an association of 
numbers with certain conventional multiplets (3 worlds, 8 directions, 9 planets (graha) and so on) has 
its origin in the exigencies of Sanskrit versification. 
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finally with the invention and widespread use of the ingenious katapayddi notation, 
especially popular in Kerala. Its basis is a systematic one-to-many map of the atomic 
numerals into the consonant syllables that takes advantage of the organisation of 
most of the Sanskrit (and Malayalam) consonants in groups of five, the latter then 
being used in a place-value format, turning numbers into mnemonic ‘words’ which 
could generally be contrived so as to give them a sensible meaning. Thus a given 
number has many verbal representations, giving the system a welcome flexibility. 
What matters more is that the inverse map from a ‘word’ to a number is well defined 
and easy to figure out; in particular, the syllabic length is exactly the number of 
‘places’ (examples: md-tu-lah = 365, dhi-ja-ga-nnü-pu-ram = 210389, both from 
YB). It is also properly recursive and comes closest to the abstract ideal, namely 
a purely symbolic representation, without need or room in it for names for any 
number other than the atomic ones (including for powers of the base), even for 
oral communication — one might say that with the katapayadi, Indian mathematics 
finally settled its dues to the Sanskrit syllabary. It is then something of a surprise to 
find it used so very sparingly in YB, written at a time and place where it was all the 
rage. Perhaps this was because YB was free from the metrical discipline of verse. 
What it prefers to use instead are the common Malayalam (and Tamil) number 
names, more systematic than Sanskrit-based ones and still in currency.” 

Itis true that many of the points made above are not directly relevant to a strictly 
symbolic written representation such as the Babylonian sexagesimal system. In 
order to read their numerical tablets, we do not need to know what names the 
Babylonians gave to their atomic numerals 1 to 59, leave alone what joining rules 
they had. We do not even need to know whether they had names for their numbers at 
all. It is nevertheless an amusing exercise to compare bases: what relative advantage 
or drawback can be discerned in the choice of a particular base, 10 or 60? Whether 
written or spoken, there is no escaping the requirement that each atomic numeral 
has to have an individual name or symbol which will have to be memorised. The 
Babylonian strategy of employing a base-within-a-base to simplify writing —clusters 
of the unit symbol up to 9 and then a new symbol for 10, similarly clustered for 
multiples of 10 — we may take as having been devised mainly to overcome the 
barrier of 10. (Did the names they must have had use a similar strategy?) Did the 
Indian idea of using 10 not merely as a notational abbreviation, but as the base in 
the place-value representation, emerge out of this and out of the realisation that it 
is wasteful to have two different fixed numbers playing two distinct roles where 
one would do? In our current state of historical knowledge, the question has no 


23There is one singular feature of these names worth mentioning: the number 9 is designated as 
‘1 less than 10’, a usage which spills into the names of 90 and 900 (though not, as in Sanskrit-based 
languages, into 19, 29, etc.). Taken together with the identity of the word for counting and the name of 
the number 8 in Dravidian languages, it has given rise to speculation that the old Dravidian civilisation 
counted in base 8; see Fairservis [23], quoting Kamil Zvelebil, who make the case that this may actually 
be a legacy from the Indus valley language. 
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satisfactory answer. It is also probably moot; the very emphasis on naming points to 
a more likely source of inspiration, the proximate one of the phonetic organisation 
in the vedic period of the Sanskrit syllabary. It may in fact very well be that both the 
number names and the syllabary have a parallel early vedic origin and evolution. 

In the Indian oral system, one had to have names also for the powers of 10; how 
many were needed depended on how high one wanted to count. It is completely 
elementary to see that, to count up to pararddham (10! ), one needs 27 (10 for the 
atomic numbers and 17 forthe powers of 10) distinct decimal names, to be combined 
by the joining rules, and 69 (60 + 9, 60? < 107 < 60!9) sexagesimal names; to 
count up to 10’, the corresponding numbers are 17 and 64 respectively. The decimal 
system wins simply because of its far smaller number of atomic numbers. This may 
appear a frivolous reason to choose 10 over 60, but in a dominantly oral culture, 
it is not to be dismissed as irrelevant. And when it comes to doing arithmetic, the 
advantages are more concrete. For instance, once the place-value multiplication 
procedure is put in place, a major achievement in itself, one has only to learn ‘by 
heart’ the multiplication table up to 9 x 9, something we all did in primary school, 
and then just go by the rules. The basic sexagesimal 59 x 59 multiplication table 
cannot ordinarily be memorised by mortals. As is well known, the Babylonians 
overcame this obstacle by using formulae like 


(m+n)? — (m — ny 
mxn = = 
4 


and preparing written master-tables of squares. 


2.3. Arithmetic 


The fundamental arithmetical operation in YB is the addition of | together with its 
inverse, the subtraction of 1. All the eight operations it explicitly describes, general 
addition and subtraction, multiplication and division, taking squares and cubes, and 
taking square roots and cube roots, are built out of these two basic ones by, it is 
needless to say, their repeated application: “Addition is of use in multiplication, 
multiplication in squaring, squaring in cubing. Similarly, subtraction is of use in 
division, division in [taking] square roots, square roots in [taking] cube roots. Thus, 
the preceding [operations] will be of use in the succeeding” ([Sarma] 1.3). (The eight 
operations are first classified as those which increase and those which diminish). 
Addition of two numbers m and 7 is, as expected, explained as the n-fold repeated 
addition of 1 to m (in more modern words, repeated applications of the successor 
function) and similarly for subtraction. There is no explicit reference to negative 
numbers in this (first) chapter; what to do with n — m forn < m does not geta 
mention. 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


308 P. P. Divakaran 


Multiplication of m and n is then defined, also as expected. as the addition of m 
to itself n times. ("Then multiplication — it is just addition if we reflect [on it]"). The 
only slightly surprising point arises in relating division to subtraction, accomplished 
by falling back on geometry in the time-honoured way. Imagine the dividend n to 
be the area of a rectangle one of whose sides is the divisor m. The quotient is 
then the maximum number of rows or columns, i.e., 7 x 1 rectangles, that can be 
removed from the whole figure: n/m :— / such that n — | x (m x 1) = 0. Again, 
it is tacitly understood that 7 has to be a multiple of m; fractions are not mentioned 
here. 

All of this is totally in conformity with the Indian approach to the logic of 
arithmetic from much earlier times. In particular, there is no sign of any sympa- 
thy for abstraction as might have been expected 400 years after Bhaskara wrote 
Bijaganita. Even by implication, there is no hint that subtraction of and division 
by arbitray positive integers will automatically force the extension of the natural 
numbers to include negative integers and fractions. Fractions in their own right 
make their entrance soon enough, in chapter 3, where, for example, division by 
n/m is carefully shown to be the same as multiplication by m/n. But we have to go 
deep into the calculus of the series for z to see the first algebraic statement about 
negatives, a citation of Brahmagupta's well known aphorism and its generalisation 
to indeterminate numbers ("quantities (rasi) whose numerical values (samkhya) are 
unknown"). The terms in the z series are given by a recursive rule and they alternate 
in sign, so this belated recall of well-established principles and their easy adaptation 
to variables was inescapable. It is certain that there was no fear of negatives, there 
had not been any for a long time [11], but they were still, to facilitate computation, 
handled with care (Here, if an indeterminate number (rai) is of negative nature, it 
must be distinguished by some mark”; the context is computation in indeterminates 
with the aid of tokens, [Sarma] 6.8, p. 75). 

It is equally clear that there was, if not fear, a degree of wariness regarding 
irrational numbers, even after Madhavan had broken through the rationality barrier 
with his z series, the significance of which was well understood by Nilakanthan. 
The suspicion of irrationals also goes back a long way (see [16]). In YB itself, 
the procedure explicitly described for extracting square and cube roots of numbers 
presupposes that the answers will be whole numbers. Later, in the chapter on 
fractions, it is explained that ./n/m is ./n/./m, but n and m are still squares. 
Throughout the history of Indian mathematics, there does not appear to have been 
any attempt made to understand the essential difference between fractions and 
numbers such 4/2 (or z for that matter) till we get to Nilakanthan, even though during 
and after Aryabhata's time the irrational z was often approximated by the irrational 
./10 rather than by more accurate and easier-to-handle rational approximations. 
Nor, apparently, was the possible irrationality A/2 ever broached despite accurate 
algorithms for finding rational approximations to it being known from at least the 
time of the Sulbasütra. 
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The fear of the irrational may provide an indirect explanation for another en- 
during mystery: the absence of a decimal representation for fractions in Indian 
mathematical work. It would have been a natural and undemanding thing to have 
done, as demonstrated convincingly by the Babylonians. Can it be that the avoidance 
of decimal fractions has something to do with the fact that even simple fractions 
can have decimal expressions that are actually, not just potentially as in the case of 
whole numbers, nonterminating? 

There is an irony here. When it came to geometry, there was no way to keep 
away from square roots since one of the two pillars of Indian geometry, the theorem 
of the diagonal (Pythagoras’ theorem; the properties of similar triangles form the 
other pillar) cannot possibly dispense with square roots. In the early part of chapter 
6, YB describes the (infinitely iterative) procedure of getting at the circumference 
from the diameter by starting with a circumscribing square and repeatedly doubling 
the number of sides through circumscribing regular polygons. (It is a good guess 
that Aryabhata found his approximation to z in this way). The procedure is of 
course riddled with square roots. It must have rankled. Before starting on the first 
steps that will finally lead to the z series, YB announces: "After this, [I] describe 
a way of producing the circumference for any desired diameter without having to 
take square roots”. With such undramatic words were his pupils initiated into the 
new-born discipline of calculus by Jyesthadevan. 


2.4. Origins? 


Let us conclude this bird’s-eye view of the grammar of numbers with a brief return 
to the question of origins and possible influences. The natural, in fact the only, math- 
ematical candidate that might have served as a model for the decimal place-value 
‘language’ — it was a language, orally expressed, with its own grammatical rules — is 
the Babylonian sexagesimal system.^* The mathematical apogee of Mesopotamia, 
roughly 2000-1600 BCE, was contemporaneous not with the vedic but with the 
earlier Indus valley or Harappan civilisation in its late phase. Some vestiges of west 
Asian civilisations have been found in Indus valley sites, but they are so sparse 
and inconclusive as to be useless for our purpose. The reverse flow is somewhat 
better authenticated, in the form of a number of seals, sealings, etc., unearthed in an 


24The possibility of a Chinese influence is definitely discounted by the profusion of number names 
reflecting an evolved decimal structure in the Rgveda. including in its earliest mandalas, a fact already 
alluded to several times. (A forthcoming article in collaboration with Bhagyashree Bavare will look at 
the evidence in detail). The earliest Indian references to China are related to commercial activities and 
do not predate the 3rd century BCE (in the Arthasastra). Long before that time, not only a perfected 
decimal counting system but also fairly advanced arithmetical procedures using it (the Sulbasatras of 
Baudhayana and Apastamba) were well in place. For a quick tour d’horizon of Sino-Indian intellectual 
engagement, which flourished most vigorously in the first millennium CE, see Amartya Sen’s essay 
“China and India” in [24]. 
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area spreading from the Arabian coast through the mouth of the Tigris-Euphrates 
into modern Syria. As for the Indus culture within its native domains, the ruins 
and the excavated artefacts are witness to a high degree of geometrical and met- 
rical sophistication, extending from the construction of their cities and the varied 
symmetric patterns on a fairly extensve sample of pottery, seals etc. [22], to the 
very accurate regularity of the weights. The latter in particular persuade us to con- 
clude that it was a predominantly binary culture (or one based on a low power 
of 2), the weights doubling in steps from | to 32 units, with weak indications of 
a decimal progression as they got heavier. Then there are the signs showing the 
supposed numerals 1 to 7 by vertical strokes, giving rise to the speculation al- 
ready cited (see footnote 23) that counting was octal.?? All in all, there is little that 
indicates a special role for the number 60 in the greater Harappan culture in its 
evolved form over its entire duration (of at least a millennium) and extent (from 
modern Baluchistan to the Indus river basin, to the river Yamuna to the east and 
across Saurashtra). Nor is there any such evidence for the vedic culture? that suc- 
ceeded it in northwest India and then spread quickly into most of the Ganga river 
basin. 

But it is fair to say that Harappan research is still in an early consolidation phase. 
Despite the vigour that animates the field, going beyond the material remains to a 
deeper insight into the culture has proved to be an elusive quest; we are nowhere 
near getting inside the minds of the Indus people and we will not get much closer 
as long as their writing remains unread. One can always hope that there will be a 
breakthrough on the decipherment front one of these days and that the writing will 
then turn out to contain information about the west Asian civilisations with which 
they had close commercial ties. While waiting for that to happen, what is one to 
do? The course that recommends itself is to concentrate on whatever we can learn 
from what we do have. There is no question that our understanding of the vedic 
intellectual legacy is much richer and deeper compared to the sketchy and uncertain 
picture we have, or are likely to have in the near future, of the Indus culture in its 
less material aspects. That rich vein is far from exhausted. From what we know 
about vedic Sanskrit and Panini's grammar and from what we can learn about the 
Indian approach to numbers from even as late a text as YB, it does not seem to 
be a far-fetched idea to look for links between the two.’ Indeed, already at the 
level of phonology, there are striking similarities. Sanskrit organises its syllables 


25] thank Nisha Yadav fora valuable update on the current state of knowledge on the Indus civilisation. 
It should be added that the idea of an octal base has also had some doubters. 

26There are sporadic mentions of the number 60, as there are of other multiples of 10, in the 
Rgveda but no indication that the powers of 60 held any special significance (work in collaboration with 
Bhagyashree Bavare) as would be expected if 60 was used as a base. 

27 The case for doing so goes back to the vedic studies of Louis Renou and more recently has been 
most persuasively made by Frits Staal over a period of time; for an account of Staal’s ideas, see [12]. 
The whole of the present section 2 can be thought of as a limited excursion in that direction. 
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in a well-ordered sequence, the vowels first and then the pure consonants in five 
groups of five each (plus a few outliers which don't fit the pattern). the order of the 
groups depending on the part of the vocalising organs utilised in articulating them, 
progressively from the back of the palate to the lips. see for instance [25]. (There 
is also a well-defined order within each group). The genesis of this organisation 
goes back to vedic times, its final systematisation coinciding roughly with the era 
of Panini [25]. It is easy to see in the pure syllables an analogue of the atomic 
numerals; there are rules for combining them to produce all legitimate sounds of the 
language which are in turn used to formulate rules to combine clusters of sounds or 
‘words’. In the context of phonology, the vowels and the simple consonants are ‘as 
if^ prakrti (see footnote 28 below), all other sounds being construced out of them. 
(The ordering of the syllables and the combination rules are of importance in the 
katapayddi mapping of numbers into verbal expressions; for instance, the vowel 
value of a syllable has no numerical significance, just like all but the last sound of a 
compound consonant). 

Perhaps, rather than think of Sanskrit grammar as a possible imperfect model for 
the Indian number system, we should open our minds to the possibility that both are 
parallel manifestations of a particular intellectual mindset, specific to the time and 
the place. Perhaps there was a mathematical Panini, perhaps he even wrote a book 
entitled Dasadhyayi, alas lost forever. 


3. Varieties of Recursion — the Arctangent Series 
3.1. Generalities, the Notion of Samskaram 


The “doctrine of decimal numbers” which gave confidence to Newton in the han- 
dling of infinite series in his early work in calculus had been a constant presence 
in Indian mathematics from at least the vedic era. When the Kerala mathematicians 
found their own route to calculus, two and a half centuries before Newton, they 
were the inheritors of a decimal culture that was seeded two and a half millennia 
earlier and had grown to healthy maturity in the following thousand years or so. 
Though it is not in the nature of Indian mathematical writing to spend much time 
on motivation and history, the influence of decimal numerical thinking is so evident 
in all of it that it is commonly characterised as fundamentally computational. So it 
is not only in the Kerala school’s fearlessness of infinite series (and of the idea of 
infinity itself) but in the totality of their mathematics that we come upon the per- 
vasive undercurrent of recursive construction that is at the core of the place-value 
representation of numbers. 

Recursive construction generally figures in Kerala writing on calculus in the 
guise of a samskdram (refining; occasionally, other synonymous terms are also 
employed). The result of a process of samskaram is something that is samskrtam, 
that which is or has been refined. This is also the name of the language in which 
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the writing itself was done, Sanskrit, as distinct from prakrtam or Prakrit, that 
which is natural.” There are variations in the details of how mathematical refining 
is carried out in different situations, but the general idea in its simplest variant 
can be summarised as follows. In a given problem (evaluating a geometrically 
defined quantity, solving an algebraic or differential equation, etc.), i) use insight 
and ingenuity to make a first guess of the answer, generally incorrect; ii) use the 
formulation of the problem itself (i.e., in ignorance of the correct answer) to obtain 
an exact expression for the difference between the exact answer and the first guess 
that displays the (unknown) exact answer in a practically usable form; and iii) 
repeat the process by substituting for the exact answer from the expression obtained 
in step ii), which will result in an updated guess displaying, again, the unknown 
answer, and so on. Stated like this the procedure may appear both pointless and 
vague. Slightly less vaguely, suppose the first guess for the unknown answer x is xo 
and the difference x — xo is a known and easily computable function f of x. Then 
X = xo + f(x) = xo + f(xo + f(x)) = xo + f(xo + f(xo + f(x)) = --- (n 
practice it may be necessary to approximate f in some clever way). The examples 
from YB that we are going to be dealing with will illustrate the variations that are 
possible within this general framework. 

But, before that, we may already take note of some of the qualitative features that 
emerge from these examples and illustrate how they vary from instance to instance. 
First, the procedure may be difficult to carry through beyond one or two iterations 
and so may have to be terminated, resulting in an approximation, often characterised 
as gross (sthülam) or fine (siksmam), to the correct answer. In one case where this 
happens (in the problem of estimating the truncation error in the z series), YB goes 
back and plays around with the initial guess so as to get a better approximation. The 
second possibility is that the procedure leads to the exact answer either in a finite 
number of steps (finite recursion, the most sophisticated example being kuttakaram 
or kuttaka, *pulveristaion')?? or by leading back on itself (cyclic recursion, as in 


28YB is of course an exception, for reasons unknown. But it is so clearly in the Sanskrit sastra 
mainstream that the actual language of writing can sometimes seem almost an irrelevance. Early in the 
first chapter, there occurs the elliptic sentence, “The numbers one to ten [written as words in the text] are 
like (as if they were) prakrti [that which occurs naturally, the original]", followed by an explanation of 
how at every stage of extension from what I have called atomic numbers to successively higher powers 
of 10, the earlier stage of construction becomes the prakrti for the next one. Tampuran and Ayyar [5] 
provide an enlightening footnote on the general connotation of the term prakrti in the sastra literature 
and the reason why YB has to resort to the equivocation of “as if". Malayalam grammar uses the same 
term to denote an *original' word from which derivatives are constructed for related linguistic units 
such as case endings for nouns. It is useful to remember that the two hundred years during which the 
Kerala school flourished was also the time that saw Malayalam, having absorbed the massive verbal and 
structural invasion of Sanskrit, beginning to settle down to its modern identity. 
2°The method of kuttaka was devised in its most elementary form by Aryabhata to solve equations 
of the form by — ax = +c in positive integers for positive integer coefficients a, b, c. The problem 
arises in matching (for example) the solar and lunar periods in astronomical time-keeping and has a 
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the putative exact solution of Aryabhata’s difference equation for the sine), The 
third and final possibility is the most interesting: that the recursion does not end but 
has a sufficiently regular structure for it to be carried through ad infinitum (infinite 
recursion). The resulting series are the most celebrated fruits of the Kerala calculus, 
but the main emphasis in the present article will not be on the series per se, nor on 
the deeper methodological advance (the infinitesimal method) that led to them, but 
only on the recursive element. 

To illustrate some of the issues specific to infinite recursion, it is enough to refer 
to the first two examples from YB, the z series 


and its generalisation to the arctangent series for an arbitrary angle 0 lying between 
0 and 7/4, 


tan?@ ~~ tan?8 


9 = tan — 
3 5 


Infinite series such as these, arrived at by a formal recursive procedure, run the 
risk of not converging to any finite answer. Whether by insight or otherwise, the 
Kerala mathematicians invariably avoid this ever-present pitfall — it is trivially easy 
to produce recursively constructed series which do not converge, as the following 
subsection will illustrate. The question of convergence is a particularly sensitive one 
for the arctangent series since the variable tan 9 is not bounded. It is quite obvious 
from the paragraph from chapter 6 of YB explaining how to go from the z series to 
the arctangent series that its author was aware of this and knew what to do about 
it. The first sentence, “Here [I] explain how to find the arc [R8] corresponding to 
the bhujajya [R sin] or kotijya [R cos0], whichever is smaller" ({Sarma] 6.6). 
makes it clear that what is sought is not a formal expansion of 0 in powers of 
tan @ for the entire range of values of @ (it is enough to consider angles between 0 
and z/2 which is what YB does) but rather a convergent expansion in the variable 
appropriate to a given subdomain of values of 0, i.e., that the tan 9 expansion is 
the correct convergent one for @ in the first octant, sind < cos. The last sentence 
of the paragraph confirms that when cos@ < sin@, the appropriate variable is in 
fact cot @. (The full paragraph is part of section 6.6, to be read with Fig. 6.6 of the 
explanatory notes, of [Sarma]). If this is not sufficiently convincing evidence of the 
concern that the series should make sense, we have the whole of the very elaborate 


whole chapter (5) in YB. The solution is a recursive one, involving the use of the Euclidean algorithm 
in a numerical (integral) form. It is very well explained in several places, e.g., [9], the epilogue (B) of 
[Sarma] and most thoroughly, especially in its connection with continued fractions, in the appendix (in 
English) to [TA]. For this reason, and since it has no direct linkage to any other of the recursive tools 
employed by YB, I shall have nothing further to say on the subject here. 
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discussion at the end of chapter 6 (sections 6.8, 6.9 and 6.10 of [Sarma]) of methods 
of accelerating the rate of convergence and of estimating truncation errors which 
would make no sense unless one knew that the series was convergent. 

There is also in YB an instance to illustrate the other side of this question as it were: 
Madhavan's interpolation formulae for the sine and the cosine. They are arrived at 
by a novel variant of samskaram dealing with two quantities (sin(Ó + ô) — sin@ 
and similarly for cos) simultaneously and so provide a very interesting example of 
recursive techniques. But, unlike in the case of the power series around O for the 
same functions, YB treats the application to interpolation briefly, not going beyond 
the second order correction and dismissing the subject with a terse remark on how 
to improve the accuracy of the approximation by two more orders — it does not 
refer to the possibility of repeating the process indefinitely. And that is a good thing 
because, if it did, the result would be an infinite series for (sin / cos)(0 +ò) in powers 
of à, a would-be Taylor series which is not the Taylor series because the coefficients 
are wrong. 

The other issue, related to this but more philosophical than technical, is one 
which seems to have caused some confusion among scholars. It is this: assuming 
that infinite recursion gives a convergent series in a given problem, did the Kerala 
mathematicians themselves think of such results as a series of successive approx- 
imations only, or as results which are in principle exact when ‘all’ the terms are 
added (‘in the limit’ as we would say), but which for practical purposes will have to 
be terminated at some point? Several historians have subscribed to the first view in 
recent times, sometimes explicitly. Partly, the confusion is due to the excessive em- 
phasis put on these series in the early writing in English (and, often, even today) on 
the Kerala work, to the detriment of the driving philosophy behind it, that of turning 
Aryabhata's approximation methods into a technique for getting exact answers by 
going infinitesimal.*° But a reading of YB leaves no room for doubt that this is a 
mistaken view. Chapters 6 and 7 are liberally strewn with remarks which either say 
specifically that the results expressed by the appropriate series are accurate or exact 
when the recursion is carried out fully, or which are meaningful only if the author 
knew that to be the case: a few instances will be found in sections 6.3.1, 6.3.2, 6.3.3 
and 6.4.1 of [Sarma]. (Chapter 7 is less explicit, often harking back to chapter 6 


39Given the incredulity with which Western mathematical-historical circles slowly became aware of 
the discovery, in a distant corner of a British colony, of the texts containing these series, this is to an extent 
understandable. But there was another factor contributing to this misreading, which is the unavailability 
until now of YB in a widely accessible language. The Sanskrit-reading historians of Indian mathematics 
have been obliged to fall back upon two texts which can fairly be called (in the best vedic tradition) 
Yuktibhasapratinidhi, YB-substitutes, namely Yuktidipika and Laghuvivrti, Sankaran’s vyakhyds of TS. 
Both are written in a rigid Sanskrit verse format that discourages any indulgence in revealing asides such 
as we meet in the more loosely written Malayalam prose of YB. Consequently, while they are invaluable 
source books on the intricate geometric constructions and computations that go into the work on the 
series, they open only a narrow window on the conceptual isssues that the development of calculus had 
to face. 
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where these issues have already been confronted and settled). The conclusion is 
reinforced by ¥B’s silence in an aparently similar context, the interpolation formula 
referred to above; it is as though the power series and the numerical series for z 
were recognised to be exact in the limit but the interpolation series would not be 
and was merely a means of approximation.?' In fact, evidence for the awareness of 
the distinction between an exact state of affairs and approximations to it forced by 
the demands of practical utility is already present in the very beginning of YB, in 
connection with the infinitude of numbers and the place-value representation of any 
arbitrary (finite) number. The sentence I have quoted earlier on the impossibility of 
knowing and naming numbers is followed immediately by: “Therefore, for the pur- 
pose of making use of them (or working with them, the Sanskri/Malayalam word 
used is vyavaharam; “them” means the infinitude of integers), think as follows. The 
first place is for the numbers from one to nine. Then the place of all these multiplied 
by ten [is] the second. Let that be to the left. The place of one, the place of ten, and so 
on, are their names. That is the nature of numbers" ([Sarma] 1.2). This is as good a 
characterisation of the distinction between the (recursively defined) axiomatic and 
the (recursively constructed) arithmetical view of natural numbers as any we can 
think of. 


3.2. The z Series and the Arctangent Series as Rectification 


There are now available in English at least two detailed and faithful annotations 
of the yukti as described in YB of these two series, Sarasvati Amma's book [6] 
which has been around for a while now and the very recent explanatory notes [RSS] 
appended to Sarma's translation. With the assurance that the reader can fall back 
on these references for the details not discussed here, we can focus on just the 
recursive elements in the contents of chapter 6 of YB in which these series are 
exhaustively treated. They are abundantly supplied with diagrams, generally based 
on the Tampuran-Ayyar critical edition, and that is also a help: on the occasions 
on which we need to look at the specifics of the geometry, I can simply refer the 
reader to the appropriate figures from [RSS] rather than reproduce them once again. 
Also, since the order in which YB obtains these results — first the numerical series 
for z which is then generalised by a use of the properties of similar triangles — 
has no special interest from the recursive viewpoint, it is unnecessary (and often 
inconvenient) to stick to that order. 

These series are answers to what in Europe came to be called rectification 
problems: one is required tc find the length of a curve given either by an equation in 


31 Because of the historical interest of the misunderstandings about this aspect, it will be worth return- 
ing to it after a fuller discussion of the interpolation formulae (section 4.3). Sankaran’s commentaries on 
TS give much greater space to these formulae than does YB and that may have contributed to magnifying 


their impact. 
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the Cartesian way or as the trajectory of a point under a prescribed motion. (In the 
present case, the curve is the circle of unit radius). The general approach used in YB 
to get to the answer is a precurser of what came to be called Riemann integration: 
divide the relevant interval of the x-axis (the independent variable, in our case tan 0) 
into n equal? segments, estimate the length of the curve corresponding to one 
segment of x by linearising it locally (here, by approximating the corresponding 
segment of arc by half the chord of twice the arc, i.e., by replacing the angle 6; 
subtended by the ith segment at the centre by sin ð), add up the n contributions 
and, finally, let n increase without bound (‘in the limit  — oo’). The limit makes 
the linear approximation exact, sind; — ð; as 6; — 0 and, moreover, allows the 
systematic neglect of terms of order 1/n in the sum as n is taken to infinity. The 
linearisation is of course the heart of the matter, the critical conceptual step beyond 
Arybhata's finitistic method. There are only two ways in which the procedure differs 
from what we teach today in a course in calculus, one minor and the other not so. 
The minor point is that the modern positive x-axis is, in the Kerala (and Indian) 
convention, the direction east which, in a diagram on sand spread on the floor, 
points away (and in the very rare diagrams on palm leaf, up), and the first quadrant 
is the E-S quadrant — so reflecting a Kerala circle through the N-S axis produces 
our conventional graph of the equation of the circle x? + y? = 1 (see Figs. 6.3 and 
6.5 of [RSS]). The more interesting difference is that the work is carried forward 
as far as it will go before the n — oo limit is taken, resulting in the uncovering of 
some striking, essentially arithmetical/algebraic, identities. The simplest of them 
is a finite counterpart of the formula for integration by parts (a simple special 
case of the Abel resummation formula, see section 3.4 below), a key step in the 
inductive evaluation of the integrals of positive integral powers. I draw these obvious 
parallels in order to remove any residual misgivings about the calculus credentials 
of YB and to highlight the fact that the postponement of the limit as late as possible 
is the essential difference between the Indian and the European approaches to 
calculus. 

Having divided the unit tangent into n equal segments by points A; (so the length 
A;-;A; = 1/n, independent of i), the geometry introduces the diagonals (karnnam) 
connecting the centre of the circle to each A;, of length d; (Fig. 6.5 of [RSS]). The 
main geometrical result is that the sum of all the half-chords, S, :— 57; sindj, is 
given by (the half-chords are denoted by b; in [RSS]): 


IT 


32The equality is not essential for the procedure to go through. It is just the easiest way to avoid the 
direct introduction of the infinitesimal limit in which each segment is taken individually to zero. With 
the division by n, the only limit that needs to be taken is  — oo. 
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This is an exact result valid for any n. In the first instance of the use of the n — oo 
limit, YB replaces 1/d;..,d; by (1/2)(1 /42., +1/d?). This is one ofthe very few steps 
for which YB provides no justification, but it is easy to see that the two quantities 


are equal to the first order in (dj_; — d;)/d;-;. Thus S,, is approximated, first by 
i l l 
K= mcm lE 
2n m (z 7) 
with dé = | and d? = 2 (normalised radius), and then by 
T 1 n 1 
i, uu d? 


S; differs from $7 by 


vun ER NE 
20 SOR? T NES; 


which is ignored as  — oo. This last step is given a justification in the words, 
"As the segment of the side [the tangent] becomes smallfer], the one-fourth part 
[—1/4n] can be discarded" ([Sarma] 6.3.2), leaving no room for doubt about YB's 
appreciation of the power of the limiting operation: 57 will tend to an eighth part of 
the circumference (= 7/4) and its expression in terms of the diagonals will tend to 
the z series. 


3.3. The Elementary Recursive Identity 


It is in expanding each term in $7 in an infinite series that the recursive identity is 
first met with. It begins with the trivial equation 


where a and b are, in the context, geometrically defined positive real numbers, 
namely a = d? whose reciprocal is the ‘unknown’ quantity to be computed in terms 
of the ‘known’ quantity b = (radius)^(— 1); thus a = b is the first guess. The exact 
correction is 


1 

a b b 
displaying the unknown a in a form suitable for iteration. The first samskaram, 
obtained by substituting the first guess in the correction term, gives the approximate 
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value 


1 1] a—b(]l a—bÍ(l1 
a b b \b Do ela ES 


The recursive identity is thus 


1 1 a-b (a-b? (a-b 1 
Lco = vorp (Eee te 
e D, b? b? bk a 

valid exactly forallk = 1, 2, - - - . The kth samskaram will consist simply of putting 


a = b in the last factor of the last term. 

It is obvious that the utility of the procedure depends on what kind of a handle 
we can get on the quantity a — b. In the problem of the z series, a = d? is the square 
of the hypotenuse and b = 1 the square of a side of a right triangle. So a — b is 
the square of the third side, a — b = i?/n?; the kth recursive identity for 1/d? then 
takes the form 


i2 it i* 1 


= db les odode Col) ERR 
Dem T ir d? 


d; 
The name given to the terms (all but the last, one imagines; the terminology in the 
text is not completely unambiguous) on the right of the identity is fodhyaphalam. 
The process of determining the correction terms is itself not called samskaram 
presumably because that word is reserved for an approximation; and there is no 
intention here to stop the iteration after a certain number of steps and call it a 
day.” YB has other ideas, as comes across emphatically in the closing lines of the 


35The literal meaning of Sodhyaphalam, ‘the result of refining (or purification)’, is not very different 
from samskaram. One of the minor vexations of reading YB is what one may call the curse of the 
synonym. The availability of several different phrases for the same or a similar mathematical process 
seems occasionally to encourage the use of different, generally but not always synonymous, expressions 
for the same process as well as the use of the same expression for closely related but not identical 
processes. This is not unknown in modern mathematical writing, but can be a source of confusion 
in a mathematical language that dispenses with symbolic notation. One instance concerns the words 
samkalitam and yogam (see [16] for a brief discussion of this and other examples). In most cases, as in 
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paragraph: “In this way, if [we] divide all [the terms] by the multiplier ((radius)* (= 
l in our normalisation); i.e., if 1/d? on the right is systematically replaced by the 
whole of the right side), the sequence of corrections (Sodhyaparampara) will not 
end until [we] divide by the divisor (i.e., d?). If [we] do not divide by the divisor, the 
sequence of corrections will not end. [All we can do is to] discard it (the correction 
term) when it becomes very small" ([Sarma] 6.3.3). 

Apart from a precise formulation of how an infinite series is recursively gen- 
erated — it will serve as a template for more complicated series later — there is 
enough in this passage to clarify several issues. First is the virtue of displaying 
the unknown answer in a way enabling repeated substitutive recursion ad infini- 
tum. The demands of practical utility (vyavaháram) are met by the suggestion to 
terminate the sequence when it is deemed sufficiently accurate. But what stands 
out most is the dramatically repeated message that there is no need to end the se- 
quence, that the exact measure of the circumference, in other words the value of 7, 
can only be given by an infinite series. The message is reinforced in the opening 
lines of the following paragraph (see section 6.3.4 of [Sarma]) describing the even 
and odd terms with their alternating signs whose sum, in the limit  — oo, will 
become the exact circumference.** And it is difficult to read this passage with- 
out calling to mind how Nilakanthan interpreted Aryabhata's description of his 
own fractional value for z as asanna (approximate), that it really means that z is 
irrational.” 

All this leads directly to the question of convergence, particularly easy to deal 
with for the arctangent series because the recursive identity, infinitely iterated, 
results in a geometric series: 


l - a — by. 
LM NCC» 


a b b? b? 


with —(a — b)/b as the common ratio. So, while the identity is trivially true at every 
stage of iteration for any values of a and b, the infinite series converges if and only 
ifa —b < b (in the present instance į < n which is guaranteed; what does not work 
in the second octant is the geometric construction, as made clear by YB). Let us 
recall that itis in dealing with similar situations requiring the integration of functions 
which are not powers that Newton took recourse to formal binomial expansions. The 
binomial series for the inverse first power, which too is a geometric series, coincides 
with the iterative series, but a binomial expansion for fractional or general negative 
integral exponents seems to have had no use for the Kerala mathematicians. But 


the present one, the context helps to remove any ambiguity. The Malayalam-to-English glossary at the 
end of [TA] translates Sodhyaphalam as “correction to be applied to a result". 

34t is this kind of insight that we are deprived of in the Sanskrit verse texts, see footnote 30. 

35I have suggested elsewhere [16] that the irrationality of z may have acted as an immediate motivator 
for the development of the z series. 
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if the need had arisen, they probably had other effective techniques to hand, for 
instance a form of iterative approximation that appears to have been used already 
in the Sulbasütras and subsequently in the Bakhshali manuscript to extract square 
roots. There are interesting recursive issues relating to these formulae which I take 
up briefly in section 4.6 where we will see that the Sulbasütra formula may well be 
the first recorded instance of the use, in an arithmetical computation, of the general 
method of samskaram. 


3.4. Recursion as Proof, Induction 


Having shown that $7 — S, — O0 as n — oo, YB proceeds to demonstrate how 
S = S” := liM, S% can be evaluated. This is done by taking the limit of each 
term in the sum 


S; = 15.0 ar In.2 + In4 Tee 


with 


l n 
Ink := -ar > i* 
n nk+! 


i=l 


for k = 0,2,---. The computation of  := lim, 7,4, in terms of which the 
circumference is finally expressed: 


T 
4 7 dh let 


is explained in elaborate detail (sections 6.1 - 6.4 of [Sarma]) in what is probably 
one of the best organised pieces of exposition in the book. The contents are worthy 
of the care taken. Firstly, it encompasses the first results that can be considered 
*pure calculus', answers to questions which had never been posed before, namely 
the values of the definite integrals 


1 
i= x*dx, 
0 


for all (not just the even) nonnegative integral k, formulating along the way the 
principle of integration by parts. (Their generalisation to an arbitrary upper limit 
is addressed later by means of a simple geometrical argument). More interestingly 
from the perspective of this article, the integrals are evaluated by a method which, 
except for the language, is indistinguishable from a modern inductive proof: the 
value of J, is expressed in terms of J,_; and so on till Jọ = 1 is reached. After 
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a prefatory sentence: “Even though it is not useful here, [I am] describing also 
integrals?? of equals multiplied three, five, etc. times as they occur in the midst of 
those which are useful" ([Sarma] 6.4), YB first evaluates /; from /o(— 1) and /; 
from /, in detail, then /; from h and 74 from /; progressively more briefly, “by 
virtue of the reasoning given earlier". A final passage gives the general rule: “To 
make integrals of higher and higher powers, multiply the particular integral by the 
radius and remove from it itself divided by the number which is one greater (for 
unit radius, this amounts to 4; = J, — I,/(k + 2))" ({Sarma] 6.4.4). However, in 
order to highlight the recursive content, I compress all of this? into a description of 
the general case of evaluating /,;, from 7;, following faithfully YB's own steps in 
going from Jp to J; to h. 

The prior announcement about the need to consider odd values of k, “though 
not useful here", already offers a hint that it was an essential step in the proof and 
that the proof will turn out to be inductive. The details bear this out. True to its 
invariable practice, YB carries out as much of the procedure as is practicable for 
finite n, before taking the limit. In doing this it is a simplification to drop the powers 
of n from the definition of the integrals (samkalitam) and to focus on the sums of 


powers: 


n 
z2 > ik 
An ce p. 


izl 


The first step in evaluating A, , is none other than the first stage in a general process 
of samskaram as I have described it in section 3.1. One makes an insightful first 


guess B, , by replacing one factor of i in each term by n: 


n 
B,y:—n > pos NAn k-1- 


i=1 


The ‘error’ is 


=i 
Bn — Anke = > (n — i)i. 


i=l 


36The word employed is samkalitam, which is used in this part of the book for both /,,; and the 
limiting quantities 7%; see footnote 33. 

37The thought naturally arises that Jyesthadevan was aware that this was an unprecedented sort of 
yukti and so was presenting his reasoning with the care and attention to detail that it called for. On the 
value placed by the Kerala masters on consensus as a means of validating a yukti, on convincing their 
peers and disciples of its correctness, see [3]. 
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A simple reordering of the sum on the right side now leads to the identity 


n=l n-l i n—] 


“a — i)i! = » ws = > Ain, 


iz] f=1 j=l i=l 


so that the difference between the exact value and the first guess can be expressed 
as 


n—l 


Bik x Ank = 2x Atk-1-* 


i=l 
So 


n—l 


Ank — nÁqk-1 x DN Aj k—1- 


This is the key relation. In contrast to the application of the elementary recursive 
identity, however, Bak = nAn 4—; becomes a ‘known quantity’ only after the sum 
of the (k — 1)th powers is computed. Moreover, the difference does not display A, x 
itself ina nice form amenable to a simple iteration but only as a linear combination of 
sums of the (k — 1)th powers of numbers upto n — 1. A simple iterative computation 
of An, and hence of J, and 7, is therefore out of the question. But it opens 
the door for an inductive evaluation since it allows /, to be related eventually to 
Iy—,. A second problem is that combinatorial complications make it difficult to 
handle A, themselves for any k but the lowest few.?? So at this point, YB takes 
n to be asymptotically large before taking the inductive step, even for k = 1,2,3. 
Accordingly, it begins with A,,9 = n ~ n (~ denotes the asymptotically dominant 
term) to show that A, ~ n?/2. We can save some writing, without in any way 
detracting from YB's basic method, by short-circuiting its careful progress through 
k = 2 and k = 3 and dealing with the case of general k as a modern inductive proof 
would. Assume therefore that A, ; ~ n't! /(I + 1) forall! < k + 1. Then in the key 
relation for A, k+1, we have B, 441 ~ n**? /(k + 1) and, for the error term, 


n=l n—l k+l 1 


L 
S Ait x 2, Gal Ea bet 


i=l i= 


384,0 = n. The values of An,1, An,2 and An,3 were known since a long time. For general k, Aj, 
are expressible in terms of Bernoulli numbers, but there is no evidence that the Kerala school had exact 
expressions for them. 
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Forn — œO, An—1,441 tends to A, +1; so putting it all together, we arrive at 


Or, 


Án kl [uU . 
By restoring the powers of n in the definition of /, ;, it is easy to rewrite this in the 
form /, = kl, 4 /(k + 1), i.e., I = 1/(k + 1). 

Once again, there are methodological insights to be drawn from this first-ever 
evaluation of integrals of arbitrary positive integral powers. To begin with, there is 
the reliance on an enlightened first guess as an almost automatic initial step and on 
its subsequent refining. But instead of leading to a simple iteration, the purpose it 
serves here is as the starting point of a procedure that ends in relating a quantity 
indexed by k, Ig, to /,_;, in other words a genuinely inductive procedure. We have a 
hint that this is how the calculation of /; was meant to go in the deliberate foregoing 
of the use of the known values of /, ; to evaluate /; fork upto 3 (even the sum A, ; of 
the arithmetic series of natural numbers is not cited); the yukti is designed to cover 
all k as the passages cited earlier suggest. The hint is reinforced by the terminology. 
The integral /, is called mülasamkalitam, the root from which the samkalitams of 
all higher powers are generated recursively (except in one place where it is referred 
to as kevalasamkalitam, simple integral, to dstinguish it from mutiple integrals). 
Related to this is the painstakingly exhaustive exposition of the entire computation, 
letting the student/reader see at every step that the method is logically rigorous — 
there is no guesswork here.?? 

A final insight concerns the reordering (or resummation) identity which I write, 
generalising slightly its form as used in the study of Zn., as 


n=l 


n=l i 
AEON A DY Fi 


i=l i=l j=l 


39Compare Newton on the evaluation of the coefficients in the sine series: “Let it be noted here, by 
the way, that when you know 5 or 6 terms of those roots you will for the most part be able to prolong 
them at will by observing analogies” (De Analysi per Aequationes Numero Terminorum Infinitas in 
[18] Vol.II, p.206). In one of his letters to Oldenburg (for onward transmision to Leibniz), the Epistola 
Posterior [19] Vol.II, p.130, Newton writes about how he “first chanced upon" the general binomial 
series as an extension of the results of Wallis by means of “intercalation” and "interpolation". On the 
question of the nature and role of proof in Indian mathematics, the reader may find it instructive to read 
Srinivas's essay in [Sarma] alongside Roddam Narasimha [3] cited earlier. 
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for arbitrary (say real) coefficients fi. (Its elementary proof is the same as in the 


special case f, = i^). One recognises it for what it is by passing to the limit, 
rewriting it as an identity for integrals 


if (x — y)f (y)dy = dy F (z)dz 
0 0 0 
of real functions f. To prove it the modern way, define 
g(x):— n fG)dy 
0 


so that, by the fundamental theorem, f (x) = dg(x)/dx. This enables us to integrate 
the second term on the left by parts. The left side is then 


- X dg(y) . : . "X PX y 
xg(x)— | y d dy = xg(x) — xg(x) + g(y)dy = dy f(z)dz 
0 0 Jo Jo 


y 


which is the right side.” 


3.5. Trial and Error 


Having found an exact expression for z /4 as an infinite series of alternating recip- 
rocals of odd integers, YB concludes chapter 6 with a section which Tampuran and 
Ayyar entitle accurately “Refinements needed to lighten (or shorten) the work of 
computing the circumference — their justification (upapatti) and accuracy"?! These 
refinements take the form of estimating the remainder after a finite number of terms 
of the series by a trial-and-error procedure of samskaram leading to two different 
outcomes: alternative, faster-converging exact infinite series or approximate but 
extremely accurate estimates for the remainder as low degree rational functions of 
the termination point. True to its general style, YB does not say why it is absolutely 


40The essential point of this exercise is that the finite version of the fundamental theorem is self- 
evident — the whole is the sum of its parts — and is never mentioned in any form in YB. This is an 
interesting aspect of doing as much work as possible before the limit is taken: it deemphasises the role 
of the fundamental theorem. 

4\1n Sarma’s translation, this section (6.8) is headed "Antya-samskára: Final correction terms”. The 
section opens with the sentence: "'[I] describe [now] how the results obtained by dividing by higher and 
higher odd numbers are refined to make them come closer to the circumference by performing a final 
samskaram”. The terminal clause can alternatively be read as “. . . by performing a samskdram after the 
last term”. The expression antya-samskara, with its funerary connotation (‘the last rites’) does not occur 
in this section nor, in my reading, anywhere else in the book. What is used here instead of antya (and 
elsewhere in the section to denote the term at which truncation is effected) is an equivalent non-Sanskrit 
Malayalam word having the same literal meaning (‘final’ or ‘in the end’) but not the ritual connotation. 
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essential to look for methods of reducing the computational burden, but this is one 
motivation that is easy to guess. The z series is unfit for vyavaharam, on account of 
its slow convergence - the first 56 terms add up [TA] to a value which is wrong in the 
third significant figure, much worse as an approximation than Aryabhata's dsanna 
rational value which is correct to the fifth significant figure. For an endeavour that 
had its origin in dissatisfaction with rational approximations to z, this must have 
been galling. 

In every other respect, however, this section is completely out of character. For 
the first and only time in the book, it has a good pedagogical discussion of mat- 
ters that can be considered genuinely algebraic, namely operations on polynomials 
and rational functions. Since that discussion, fleeting as it is, is modelled on arith- 
metic with decimal numbers, it will be given its due briefly in section 5.1. More 
remarkably, after explaining at length the ‘philosophy’ of an ingenious and novel 
semi-empirical technique of samskdram, YB contents itself with listing several re- 
sults the technique is said to lead to by quoting (Sanskrit verses) from 7S, without 
proof or any sort of justification — there is no yukti in these passages and little 
bhasa. 

The conspicuous absence of yukti has encouraged some modern commentators 
to attempt to supply one, and some of the work in that direction has led to an 
intriguing situation. For instance, C. T. Rajagopal (one of the pioneer scholars of 
YB) and M. S. Rangachari have pointed out [26] that there is a beautifully recursive 
procedure to evaluate the truncation error exactly as an infinite continued frac- 
tion whose first few approximants coincide precisely with YB's recipés. So, did 
Madhavan or Nilakanthan use such methods to arrive at their spectacularly effec- 
tive results? which YB for reasons unknown decided not to elaborate? Within this 
mystery lies another riddle. Indian mathematics as a whole seems curiously un- 
willing to employ continued fraction expansions, a recursive device par excellence, 
especially as generated by the Euclidean algorithm. (Its rudimentary use in kuttaka 
seems to be the exception that proves the rule). As of now, there is no satisfactory 
explanation; how these approximations were arrived at remains something of an 
enigma.? 

The philosophy of the refining process is, briefly, as follows. YB first expresses 
the remainder after summing up to the ith term (with denominator 2i — 1 —: j; so 
j runs through the positive odd integers) as 1/r; with sign opposite to that of the 
last summand (r; is therefore positive for all j). Then the remainder after truncation 


42The third and final (as far as YB is concerned) approximant for the correction after 56 terms 
improves the accuracy of x from the second to the eleventh significant figure [TA]. 

*3Rajagopal and Rangachari are, very properly, cautious on this question: they preface their continued 
fraction solution of the equation satisfied by the remainder with the sentence, "Let us make some 
observations which (in their main part) go beyond the evidence of the surviving Keralese texts but are 
perhaps not irrelevant" [26]. 
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one term earlier is given by the recursive equation 


l l l 


Ty De vi e 03 


The basic idea is to think of the sum of the first (j + 1)/2 terms of the z series 
for a chosen j as the first guess in a process of samskaram and the remainder 1/r; 
as the correction resulting from the would-be samskdram. The equation above for 
the remainders (the error equation) is the condition for the samskdram to give the 
exact value for the remainder and the aim is to use it as a guide to arrive at a ‘good’ 
approximation. After recognising that an exact solution (as a finitely expressed 
function of j) is probably impossible, the discusion proceeds to make a patently 
absurd first guess, but now for r j, namely rj? = rj = 2j, and rejects it. The reason 
given, in a first explicitly functional piece of argument (the notion of a function in 
a geometric setting is implicit in the generalisation of the z series to the arctangent 
series, but here there is no longer any geometry to fall back upon), is that the 
remainder after the j-term must depend on j “in the same manner" as the remainder 
after the (j — 2)-term depends on j — 2, i.e., that r must be a (non-constant) function 
of j. So YB now offers a revised, functionally consistent guess, rj = 2(j + 1) + x, 
rj-2 = 2(j — 1) + x, which has the property that r; — r;_2 = 4, independently of 
j and x. This of course does not solve the error equation but only an approximate 
version of it, 


Butit is a very.good approximation for j >> 1/j — for truncation after 56 terms for 
instance, j is of the order of 100 and the accuracy is roughly 1 part in 10,000. (YB 
actually defines and computes the inaccuracy as the difference 1/r;..? 4- 1/r; — 1/7). 

The search for even better approximations is now carried further. The variant 
rj =2j+1,rj-2 = 2j —3 (keeping r; — rj? = 4; no reason for doing so is given) 
is tried and, once again, rejected since the inaccuracy is now one order in j higher 
(“in the second place in j”; the reference is to a place-value notation with variable 
base j for polynomial functions of j, see section 5.1). At this point, the account 
gives up on the condition of constant difference between rj and rj—2 and suggests 
solutions of the form r; = 2j + 2+ x/2j + 2), rj-2 = 2j — 2 + x/(2j — 2), 
settling finally on x = 4 as it keeps the error “in the first place in j”, (as said by the 
preceptor (acaryan)", Nilakanthan, Jyesthadevan's guru). The approximations for 
m /4 resulting from these prescriptions were first written out in [TA] and, in English, 
can be found in [RSS]. 

Hereabouts, YB introduces yet another recursive novelty and, simultaneously, 
chooses once again not to remain true to its title by citing several results from 
TS without any yukti accompanying them. The novelty consists in turning a given 
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samskaram for the remainders into a reordered infinite series for z by substituting 
the chosen correction terms for each of the terms in the original series, i.e., by 
replacing 1/j by 1/rj—2 + 1 /r;, resulting in 


If the r; satisfy the error equation, this is an exact but logically trivial result: 
7/4 = | — 1/ry; itis just the definition of r;. It becomes interesting only if the r; 
are approximate and do not satisfy the equation. But then it is no longer clear that the 
series sum up correctly to the value z /4, though one may suspect, correctly as we 
know now, that they will if the r; are sufficiently fine (suksmam) approximations. 
The series resulting from various choices for r; (see [TA], [26] or [RSS ]) are quite 
remarkable and some of them have been linked, via continued fractions, to certain 
results of Euler and of Ramanujan [26]. 

From the recursive perspective, this approach is an almost perfect illustration of 
the general philosophy of samskaram: one begins with an identity, namely the series 
for z in terms of r;, then approximates each r; in such a way that the errors are 
always of order higher than j and will (hopefully, since the procedure is very formal) 
cancel out in the end. Indeed, by the time chapter 6 is brought to a conclusion, we 
have been exposed to a whole sequence of samskarans, nested one within another, 
of varying degrees of siksmata. The concluding passage is yet another verse from 
TS, containing yet another approximation, finer by one degree in j — the third 
approximant in the continued fraction expansion [26] of 1/7; — leaving the question 
of where it all really came from tantalisingly open. 


4. Varieties of Recursion — the Sine/Cosine Series 
4.1. From Aryabhata to Madhavan 


If the z series and the arctangent series are the logical culmination of attempts to 
improve upon Aryabhata's approximate rational value for z, the sine and cosine 
series are, in the same measure, the final goal of the search that began with his sine 
table. Already in AB, the two problems are closely linked: without a sufficiently 
precise value for z , a table of sines accurate to one minute of arc could not have been 
made, no matter how finely the circle is divided. Indeed, the two verses setting out the 
general idea and the recipé for the computation of the sine differences (Ganitapada 
11 and 12) are immediately preceded by the z stanza. And, in going beyond 
Aryabhata's level of precision, there would have been little point in Madhavan's 
fine elaboration of a theory of trigonometric functions without a correspondingly 
fine and precise value for z . From these two interlinked perspectives therefore, the 
parallel development of the z series and the sine series in YB is very natural. 
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The surprise then is that the breakthrough took so long. The two Bhaskaras, 
both devoted Aryabhata followers, seem to have missed the deeper meaning of 
Ganitapada 11 instructing the listener (or reader) to divide the circumference so 
as to make it possible to find as many sines of equal arcs as one likes (yathesta). 
Whatever the reason for this oversight, and that is another story, it took until the 
advent of Madhavan for Aryabhata’s vision to be finally realised. 

The accurate/exact evaluation of z having been dealt with in chapter 6, the first 
half of chapter 7 of YB (upto and including section 7.8 of [Sarma]) is concerned 
wholly with the circle of ideas pertaining to the accurate/exact evaluation of sin 0 
for an arbitrary value of 0. Quite remarkably, the first topic addressed is the original 
Aryabhatan numerical table of sine differences (of limited accuracy, necessarily) 
for values of 0 = mz /48, m = 0, 1,--- , 24, covering the first quadrant. (I shall 
use € to denote the angle z /48, Sm for sin mne, ós,, for the difference Sm — s,,..; and 
6*s,, for the second difference OSm+1 — OSm = Sm+1 + Sm—1 — 25m). One might have 
thought that this was wasted labour for the author, knowing as he did that an infinitely 
accurate procedure which worked for any angle was soon going to follow. Apart 
from showing reverence for knowledge from earlier times (*pürvasastra", meaning 
really AB), and familiarising the reader with the algebraic properties of differences 
and second differences of functions in a familiar situation, this may have had a 
practical motivation: the computation of the table using Aryabhata's formula or the 
equivalent procedure given in YB takes much less effort than working with power 
series. In fact the section of YB containing the yukti behind the table is followed 
by the description of Madhavan’s interpolation formulae for sines and cosines of 
angles lying between two tabulated angles making it clear that, for purposes of 
astronomical calculations, there was no need to go any further and also that the sine 
table was an essential first step. We are once again made aware that the development 
of the power series owed less to practical needs than to the attraction of a problem 
that was there and that could be solved. 

In any case, from the point of view of this article, the interest is in the fact that all 
three phases of that enquiry, tabulation, interpolation and power series expansion, 
are addressed by recursive methods of increasing subtlety. 


4.2. Making the Sine Table 


One point of departure for Aryabhata's sine table (the expression will stand for a 
table of either the sine differences or the sines unless explicitly distinguished) is the 
value of sin z /6; this is attested by (the second line of) Ganita 9, to the effect that 
the side of a regular hexagon is equal to the radius of its circumscribing circle. Thus 
the Ganita stanzas 9 to 12 take us along one logical sequence of steps leading to the 
table, beginning with sin zz/6 = 1/2, through the approximate value of z (3.1416) 
and the somewhat enigmatic (at first reading) verse 11 instructing us to divide the 
circumference into as many equal arcs as we please, and ending with the formula for 
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OS». In view of the content of verse 11, it is pertinent to wonder why this particular 
subdivision was chosen and why it stopped at 2/48. As often suggested (first by 
Playfair [20] on the basis of his acquaintance with Suryasiddhanta), it would have 
been straightforward to compute sin 2/48 by halving 2/6 successively three times 
and using the theorem of the diagonal to relate the chord of an arc to the chord 
of half that arc, a relation almost certainly known to Aryabhata; YB explains the 
geometry and gives the formula at the start of chapter 7, sections 7.2.1 and 7.2.2 of 
[Sarma]. Indeed YB concludes this part by stating that the sines of some of the 24 
Aryabhatan angles can be obtained in this way while “some others” (effectively all 
the rest) can be found by starting with sin z /4 and similarly subdividing. The whole 
table could thus have been made by direct computation using a formula for sin 0/2 
that Aryabhata knew. Given this fact, it is even more pertinent to wonder why his 
table as well as his recipé for making it deals only with sine differences, not the 
sines themselves. The germ of a plausible answer may lie in a second observation 
of Playfair's, that 2/48 is the largest angle in this sequence for which the chord is 
equal to the arc to a precision of one minute. 

A table of sines is a staple of Indian astronomical texts, right until the time of 
YB and beyond, as it was an essential aid to the computation of the true positions of 
heavenly bodies. In fact, by around the 9th century, the table acquired a canonical 
katapaya form and, at least in Kerala, seems to have been just transcribed from 
text to text. From very early times, what is most often tabulated are the sines, 
not their differences (presumably since those are what is directly employed in 
planetary calculations) but the formula itself, when it is stated, is always for the 
sine differences. Already in Süryasiddhànta, we find the table presented (in stanzas 
II.17 - 22 in the version translated and edited by Burgess [27]) directly for the sines 
while the formula (in II.16) is for their differences“: 


] 
$m — $m-1 = $1 — rsen ie FSi) 
1 


In all essentials this is the same as the more ambiguously expressed formula of 
AB as interpreted by K.S. Shukla in his annotated translation [28]. Shukla in fact 
gives several equivalent rewritings of the formula, due to various traditional com- 
mentators as they came to grips, each in his own way, with Ganita 12. All of them 
employ Aryabhatan units, namely minutes, for both an angle @ (equivalently the 
are subtending that angle) and its sine (equivalently the half-chord of arc 28 of 
the standard circle of circumference 21600 minutes and radius 21600/2z = 3438 


^^In his commentary on these stanzas Burgess expresses the curious view that the table was prepared 
notby using the formula but by direct subdivision, the formula being obtained empirically by a subsequent 
inspection of the table. Why would anyone first state a rule that was not going to be used and then use a 
different rule, unstated, to get the same results? 


CC-0. Gurukul Kangri Collection, Haridwar 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


330 P. P. Divakaran 


minutes)? In these units and to an accuracy of 1 minute, ds} = sine = € = 225, 
Ós? = sin 2€ — sine = 224 and so ó^s, = —1. Consequently, a multiplicative ó^s 
disappears from all formulae. All of the early commentators except one incorporate 
the resulting numerical simplification (a minor mischief which is easily set right by 
restoring the exact formula for ó^s;) as well as the approximation sin € = e in their 
versions of the sine-difference formula. 

The exception is Nilakanthan. Reading more into Ganita 12 than is literally 
there, he interprets the verse in a form free from conventions and approximations 
as the exact equation"? 


d Sm N c 
OSm+1 = OSm — —(0s, — 0S2) 
S| 
which, for later reference, I rewrite in the explicit second-difference form 


2 Sm 29 
Ô“ Sm = —Ó^s. 
81 


Nilakanthan has himself given a geometric proof of his formula which can be 
found, in modern notation, in Shukla [28] along with an equivalent trigonometric 
formulation. I recall that proof here after highlighting what appears to have been, at 
least by the time of the Kerala school but possibly from much earlier, the ‘conducting 
thread' leading to the consideration of differences and second differences of sines. 


454 tribute is due here to one of the less celebrated innovations of Aryabhata, the recourse to a 
natural, angular unit to measure chords in terms of the fixed angular ‘size’ of the circumference. This 
is the converse of the current practice of measuring angles in radians and defining the sine as the 
corresponding half-chord in a circle of unit radius and is strictly equivalent to it; in particular the use of 
modern sines etc. in the present work in place of the traditional Rsines etc. is in no respect misleading. It 
is in measuring chords in minutes that the value of x comes into play just as we need z to calculate the 
circumference of the ‘unit circle’. 

46 It is not difficult to see in this exact generalisation of the original approximate rule a parallel with the 
other instance in which Nilakanthan has read Aryabhata’s mind, when he takes the one word asanna to 
mean that z is irrational. Surprisingly, neither of these instances of Nilakanthan’s insight finds a mention 
in YB, surprising in the first case because it is uncharacteristic of YB not to state a general rule when it is 
available and, in the other, in view of the thoroughness of YB's treatment of the complex of issues relating 
to the z series. Besides, how can it be that the author of YB, a work composed to provide an exposition of 
the new mathematics “according to TS” as announced in its very first sentence, is unaware or negligent 
of such illuminating wisdom in the teaching of his own direct preceptor? A possible answer may lie in 
the fact that both these examples of mind-reading occur in Aryabhatiyabhasya which Nilakanthan wrote 
late in life and in the possibility that this work was composed after YB. Such a resolution is of some 
chronological interest. K.V.Sarma’s earliest date for YB is around 1530, by which time Nilakanthan 
would have reached the ripe old age of 85 or so. Knowing that he wrote at least two other works 
after his commentary on Aryabhata (Sarma’s introduction to[1]), one of them (the vyakhya of his own 
Siddhantadarpana) of major epistemological importance, it is not unreasonable to assign to it a date 
well before 1530. Correspondingly, if this work postdates YB as suggested here, 1530 will be more like 
a strong upper limit date for YB. 
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That thread starts with the realisation of the inadequacy of the rule of three 
(trairasikam) when dealing with quantities whose mutual dependence is not linear, 
in the present instance the ‘sides’ of ‘similar triangles’ when one ‘side’ is an arc 
rather than a chord of the circle. Just before the description of the computation of 
the sine table (at the end of section 7.4.1 of [Sarma]), YB has a brief but enlightening 
passage on the error of relying on the rule of three to determine chords from ares. 
After warning that it will lead to gross (sthilam) results, it ends with an explanation 
and an exhortation: 


The reason for this [grossness is as follows]. The second arc (2« in our notation) 
[is] twofold the initial arc, the third arc threefold. Thus the arcs. The second chord 
is not twofold the initial chord, the third chord is not threefold, and so on. The 
reason for this [is as follows]. The initial arc has no curvature [and is] almost equal 
to the chord since the saram (the versine, 1 — cos e; see figure 7.2 of [RSS] for the 
terminology) is small. . . .So do not do trairasikam with respect to the arc because 
the result will be gross. 


In effect we are told that sin @ is not a linear function of 0 since the relation between 
the arc and the chord is not one of proportionality. In particular, Sm is not the mean of 
554.1 and 5, and the difference sm is not constant. The grossness in the value of 
Sm that will result from doing trairaskam is thus measured by the difference between 
Sm+1 + Sm—1 and 25, or equivalently between Ós,,..; and Sm, in other words the 
second difference. The line of thinking here is evidently very similar to what we 
have already met in other varieties of samskaram: the gross first guess is that the 
sine can be linearly interpolated, i.e., that the difference ós,, is independent of m, 
and the next step is to evaluate the deviation from linearity as the second difference. 

Nilakanthan does this geometrically by considering two astutely chosen pairs of 
similar right triangles (Shukla's trigonometric proof uses the addition formula for 
sin(m + 1)e and sin(m — 1)e) leading to 


Sm = 25m (cose — 1). 


Nothing is more natural now than to eliminate cose — 1 in favour of ôs, and to 
arrive at the formula written down earlier — Nilakanthan's equation is thus exact. If 
the initial values s, and sz (and hence ô?sı = s2 — 25;) are also exactly known, the 
solution will give the exact values of all ó?s,,, all dsp and all Sm recursively. Such an 
exact solution will satisfy all the symmetries of the sine function if the computation 
is carried beyond the first quadrant: S =i= on etc. and be periodic: s97 = S1, etc. 
Here then is the quintessential illustration of what I have called cyclic recursion in 
section 1.7. 

Thus Nilakanthan. As for Aryabhata, while he may well have been led to second 
differences by the failure of the rule of three, it appears doubtful that he arrived at 
his formula along this precise path. Apart from the evidence of the actual words 
of Ganita 12, the construction invoked by Nilakanthan was probably unknown 
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to Aryabhata, being (along with several variants) a characteristic of the Kerala 
approach to circle geometry." Every piece of writing bearing on this question 
points to Aryabhata taking recourse, from the beginning, to the approximation 
sı = € along with the known formula for doubling or halving the angle to evaluate 
ó^s, as 


> pats J 2 
Ó^s; = —25,.2sin^ — = —s,€* 


NIA 


and proceeding thence recursively. More generally, Nilakanthan’s equation will 
take the corresponding approximate form 


J 2 
ORS S ET 


There is no need to underline the enormous significance of this equation. As € is 
further subdivided into arbitrarily (yathestam) small pieces (“‘atomised’’), the ap- 
proximation will tend to exactness, the difference equation to the correct differential 
equation for the sine and the two initial values sin 0 = 0 and sin € = e to the appro- 
priate initial conditions for its solution. Solving the equation also transforms into a 
subtle iterative process, yet another productive variant of the method of samskaram. 
This is one of the ways in which the sine and the cosine series were finally nailed 
down by Madhavan. YB's own treatment of the table is meticulously adapted from 
the start to a general division of the quadrant and to the eventual goal of making 
each arc segment infinitesimally small as we will see in section 4.4. 


4.3. The Interpolation Formula - Simultaneous Samskaram 


The description of the sine table is immediately followed in YB by a section outlining 
how to use the tabulated values to estimate sin Ó and cos 0 when @ lies “in the gap" 
between two consecutive tabulated angles. There are hints that this topic may have 
been considered a distraction: it is brief to the point, almost, of impatience — the 
whole passage is two paragraphs and less than one printed page ([Sarma], section 
7.4.3) long. The operative paragraph sketches a geometrical construction in terms 
of the inevitable pair of similar right triangles and indicates (barely) what to do with 
them in order to obtain a second order interpolation formula. The reader will find 
the geometry in-Fig. 7.6 of [Sarma] and an explanation of the derivation in [RSS]. 


47Shukla’s use of the addition formula is of course unnecessary and may be misleading. While 
the sine difference formula follows directly from the addition formula, the reverse implication is less 
direct. In any case, Aryabhata almost certainly was not in possession of the general addition formula. 
The distinction of its discovery also belongs to Madhavan, going by the laudatory references to his 
jiveparaspara-nyayam (literally, the principle of adjacent chords; more informatively, the principle of 
the half-chords of adjacent arcs) and its heavy use by his followers (including in YB). 
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For an incremental angle 6 := 0 — me < e, the formulae are 


; : : a b or 
sin(me + 0) = sin me + ócosme — 3 sinme, 


’ í I" 
cos(me + ó) = cosme — ósinme — 30^ COS mé 


and they depend essentially on the Aryabhatan approximation of sin ó/2 by 6/2. 
The section ends with * If this is not of sufficient accuracy, take the full chord of one 
quarter of the residual arc (i.e., 0/4 in place of 6/2) . . . if this is also not sufficient, 
take half of that . . . " followed by a reference to a verse in TS. The paragraph does 
include the word samskaram but that is about all. 

In fact the reasoning behind the interpolation is interestingly recursive. The 
sentence cited above asks us to take the computed approximate value of sin(me + 
6/2) as the new input and, proceeding as before, to calculate the new output sin as 
sin((me+6/2)+6/2); obviously the process can be continued indefinitely. But itis to 
be stressed that while there is an infinite samskaram at work here, based on a nested 
sequence of segments of arc of length 0/2! in geometric progression, the segments 
are not taken to zero at any stage — there is nothing infinitesimal, no calculus. 
Perhaps this accounts for the inhabitual abruptness with which YB terminates the 
discussion of the interpolation. To follow this particular thread, therefore, we have 
to rely on sources other than YB. 

It is useful to start by recognising that the only point about making the expansion 
around the Aryabhatan angles à = me is that their sines and cosines are supposed 
known from the table; the geometry itself works for any value of 0. Similarly, though 
dis taken to be between me and (m + 1)e, the geometry works for any angle ô. With 
this in mind we turn to Kim Plofker's careful analysis ([29, 30]) of the far more 
extensive account of this topic by Jyesthadevan's younger contemporary Sankaran 
in his commentaries on TS. As Plofker has underlined, the formulae are established 
for both the sine and the cosine simultaneously by setting up a pair of coupled 
equations for the correction: the output of alternating steps in the refining of the 
correction, say the ith for sin(@ + à), involves cos(@ + 6/2'~') whose evaluation 
involves sin(@ + 0/2!) and so on. The expressions resulting from iterating the step 
ad infinitum (Sankaran does the first seven of them) is strongly reminiscent of the 
nested brackets occurring in the treatment of the geometric series by means of the 
elementary recursive identity but more spectacular ([30]): 


d , ô ey) ó ing ó cosü ô sind ws )) 
sin@-+6) = sind +5 ( cos- = sin +35 2 2 4 , 


ô ô (sind 6 (cos 
cos(0--ó) = cos 5 (256 + 5 (coso ni Cz + 5 (= = 2 : 
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Removing the brackets turns them into infinite series in powers of ô (equations 
28 of [30], extended by repeated substitutions, in a slightly different notation), the 
numerical coefficients being, naturally, increasing inverse powers of 2. 

To our eyes these are strange series. The coefficients of successive powers of 6 
involve successive derivatives of the sine and the cosine linearly as in the Taylor 
series and the first two terms are exactly the same as in the Taylor series; but Taylor 
series they are not. We can begin to guess the precise source of the error (see the 
next-but-one paragraph) but that the series are wrong is made absolutely clear by 
the fact that they specialise at 0 = 0 to (equations 29 of [30]) 


Mo pos 
SIDE a a rns 


A OO 
cosó —1—2 poer tr f 


which are together inconsistent with the fact that sine and cosine are each other’s 
differentials up to a sign, something certainly known to Madhavan and much used 
in YB. And who could have failed to notice that they are inconsistent also with 
Madhavan’s own sine/cosine series? 

Not only is the space devoted in YB to the subject of interpolation dismissively 
short, there are also no statements in that short space to the effect that an exact 
expansion for sin(@ + 0) will result if the recursive construction is continued forever 
(no “atomisation”, no division by pararddham), statements of the sort found quite 
freely in connection with the z series (sections 6.3 and 6.4 of [Sarma] cited earlier) 
and the sine/cosine series (e.g., sections 7.5.2, 7.5.4 and 7.5.5). Is one to conclude 
then that for Jyesthadevan, unlike perhaps for Sankaran, the formulae were no 
more than a reliable numerical approximation, a means of extending the utility of 
the sine table, with no deeper significance? If the answer is yes, that will be yet 
another piece of evidence for YB’s infallible instinct for what was genuinely deep 
and original in Madhavan’s accomplishments, but there may be a more prosaic 
explanation, The recursive element in this small section, in particular the idea 
of simultaneous samskaram, comes into its own when used in conjunction with 
Madhavan’s infinitesimal philosophy to generate the sine/cosine series. In that 
context it certainly gets its due attention, in the sections of YB that follow. 

The point to be underlined at the end is that doing a sequence of iterative 
corrections of the type seen above is not by itself calculus, any more than the 
geometric series obtained in the same way or the general binomial series is calculus, 
irrespective of whether the interpolation series coincide with the Taylor series or 
not and, even, whether they are correct or not. The essence of the Kerala approach to 
calculus was the same as it was to be in Europe and easily described in the language 
of its modern version: divide the range of the independent variable (the tangent to an 
arc in the arctangent series and the arc in the sine series, next subsection) into n equal 
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parts with n eventually to be made to tend to infinity, linearise the dependent variable 
(the arc and the half-chord respectively in the two problems) as a function of the 
independent variable at each point of division and add up the linearised quantities 
ensuring that the remainders so omitted tend to zero in the limit, thereby implicitly 
establishing/employing the fundamental theorem. The interpolation formulae are 
outside that framework. The ‘independent variable’ ó is subjected to an iterated, 
nested subdivision as in the case of the geometric series (hence the similarly nested 
final expression) and as a result the segments of ò do not tend to zero in the limit 
(they are not differentials); one only has an infinite sequence of segments whose 
limit is zero. There is a process of linearisation, but the ‘error’ involved in that 
process is not controllable in the limit. At the fundamental level, therefore, there 
is no conceptual linkage between the two. These shortcomings could be overcome, 
and were, by Madhavan himself in the course of developing the sine/cosine series 
around 0 = 0, but then it becomes calculus. He could also have rewritten the series, 
but did not, as expansions around any other point, true Taylor series [16], by noting 
that the length of an arc did not depend on its initial point being the direction east, 
that it was invariant under translations of the circle. 

It is in this light that the controversy that erupted a few years ago around the 
question of whether calculus was invented in India (the debate can be followed 
starting with [30] and the references therein) is to be viewed. The misinterpretation 
was to some extent inevitable: it owed its origin to a reading of the wrong material 
(the interpolation formulae) from the wrong sources (Sankara Variyar's elaborations 
on TS) while the right source remained, untranslated and largely unstudied, hidden 
in a language inaccessibe except to a few. Thanks finally to the long overdue 
translation of YB by K.V.Sarma, aided in no small measure by the pioneering work 
of Tampuran and Ayyar to all three of whom this article is a tribute, that reason is 


no longer there. 


4.4. From Differences to Differentials 


The development of the sine and cosine series 


2 9 85 
sino 0 OT s 
and 
g ø 
cas Gies el pact 


borrows recursive elements both from the construction of the sine table (working 
with sine differences) and from the interpolation formula (the method of simulta- 
neous samskaram). The critical input however is the novel concept already put to 
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use in the theory of the arctangent series, the infinitesimal geometry of Madhavan. 
Though the conceptual and the technical go hand in hand, even more insepara- 
bly in this part of YB than elsewhere, my emphasis will again be on the recursive 
techniques employed; the originality of the calculus component of the arguments 
will figure only incidentally, mostly in this subsection and in section 4.7. So the 
description here of the geometry is minimal (greater detail can be found in [RSS]), 
only to the extent required to make the working out of the steps comprehensible and 
to highlight the specific steps that bear witness to the birth of calculus. 

Thus we begin by considering a circle of unit radius and an arc of it subtending 
an angle @ at the centre. The arc is divided into a number of equal segments and 
the objects of study are the sines and cosines of the angles defined by the points 
of division, geometrically the half-chords through these points and their distances 
from the centre, as well as the corresponding quantities at the midpoints of the 
segments. Nothing is lost and some extra effort in writing saved if we divide 0 into 
an even number 2n of segments of length ô :— 0/2n from the start; so the quantities 
of interest are s; :— sin ió and c; := cosió, i = 1,2, --- 2n. 

The reason forconsidering the midpoints of the arc segments simultaneously with 
theirendpoints becomes apparent quickly. Applying trairaskam to two suitable pairs 
of similar triangles, YB shows that the difference in the sines (cosines) of two angles 
differing by two segments is proportional to the cosine (sine) of the intervening 
angle: 


OS} = Si41 = Sj-1 = 251Cj, 
OC; [= Cj41 — Cj-| = —2s|sj. 


These exact, elementary and beautiful relations are thus the finite difference ana- 
logues of the differentials of the sine and the cosine. They are also true in the more 
general form 


sin($ + y) — sin($ — y) = 2sin y cosh 


for any ¢ and y as a quick glance at the geometry makes clear. Though no different 
in method or conclusion from Nilakanthan's treatment of the tabulated sines (with 
the arc segment fixed at z /48), their general validity for an arbitrary division is now 
acknowledged and underlined for the first time: “Here the arc segment has to be 
imagined to be as small as one wants . . . [but] since one has to explain in a certain 
[definite] way, [I] have said [so far] that a quadrant has twentyfour chords" ([Sarma] 
7.5.2). Other issues addressed by this simple construction/observation include the 
quantification of the deviation from linearity (the inapplicability of the rule of three) 
and the fact that while the difference formulae follow from the addition formulae, 
the former can be established more directly and simply. 

The second differences are defined in the same way: ó?s; :— ÔSi+ı — Ósi-1 
and similarly for the cosines. Because the sine and cosine are each other's first 
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differences, the second differences can be calculated without any further geometry: 
ó^s; = 25,óc; = —4sisi, ‘ 
Oc; = —25)05; = —4sici. 


Consequently the content of the second difference equation is identical to that 
of the pair of coupled first difference equation. YB deals with both sets, often in } 
conjunction, in its search for the solution. The slight lack of focus that results — ! 
the treatment on the whole is less streamlined than that of the arctangent series — | 
leaves one with the impression that this first record of the detailed understanding 
of what was after all a revolutionary advance was not completed without struggle. | 
In any case, the critical fact that these equations (especially the second difference | 
equation) which were originally derived for 26 = 2/48 (and for that value of ó 
are the same as Nilakanthan’s starting point for the sine table) remain valid for any | 
value of ô gets its due attention as attested in the quotation above. | 
At this point it is convenient to break with the order in which YB broaches the | 

questions that arise in finding the exact solutions in the limit and note the obvious ji 
fact that the equations themselves are well-defined in the limit as the familiar il 
differential equations satisfied by the sine and the cosine. As has been said several 

times already, YB's way is to first solve them recursively and then to take the limit 

of the solution. That will be described in the following two subsections. For the 

present, let us just note that in the limit of large n, the angular segment ô = i8/2n 

tends to 0, the chord length s; = 2 sin ô tends to 20, id becomes some general angle 

ġ < 0 and the difference equations become the differential equations 


ds(9) _ 
259 c($), | 
dcp) —— 
"m. s(p) l 
and 
P fl) | 


respectively for f = s orc, defining df(¢)/d¢ := limso( f (+8)— f (6 —9))/20. 
These equations cannot be solved by simply integrating their right sides unlike in 
the case of the arctangent series where the problem was reduced geometrically 
to a quadrature, namely solving the first order differential equation d@(t)/dt = 
1/(1 + t°), and then further reduced to evaluating integrals of powers by means 
of the recursively generated geometric series for the inhomogeneous term. The 
solution of the corresponding difference equations will face the same problem. 
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4.5. Recursive ‘Integration’ of a Difference Equation 


To repeat, the difference equations for the sine and the cosine are homogeneous 


equations, involving the same unknown quantities on the right as are sought and so 
no direct evaluation or quadrature is possible. It is the strategy devised to get around 
this difficulty that makes the final result, the sine/cosine series, the most impressive 
achievement of YB's calculus. Apart from the calculus itself, the strategy involves 
several recursive novelties of which the most fundamental is a process of samskaram 
which kicks off with the first approximation of replacing the unknown chords by 
the corresponding (known) arcs (i.e., sin $ by #) and repeatedly refining it. 

As a preliminary to the samskaram, YB makes the elementary observation that, 
in general terms, for a function f(x) and a division (xo, x1, ++- x) of the domain 
of x, 


Sfi) — f(o) = Of Gi) + Of xi) - H Afa), 
with ôf (xj) := f(x;i) — f(xi-1): the whole is the sum of its parts (the rwitsoip 


principle? the fundamental theorem of pre-calculus?). For the sine function, this 
allows us to write 


$2j = OS2n—1 + OS2n-3 + +++ + 051, 


taking account of our symmetric definition of ô and the initial value sọ = 0. The 
difference equation for s can now be used to rewrite this as 


San = 2s1(Con—1 + Can—3 +--+ +1) = 25) (Con—) H- +03) + 52. 
But each c; is also the sum of its parts, 
C2i—1 = OC273-2 +--+ + 0e2 +c} 
and each dc; can be substituted by using the difference equation for c, resulting in 
San = — 4s? (S2n—2 + 2son-4 +- + (n — 1)s2) + iso. 
An alternative way of writing this, useful later, is 
San = —457 ((S2n-2 + +++ + $2) + (S2n-4 + +++ +52) + + +52) + nsa. 
The decomposition on the right into a series of series is designed to aid the use of a 
systematic process of samskaram and their sum is a measure of the deviation of the 
chord of the whole arc from the sum of the chords of the (eventually infinitesimal) arc 


segements. The corresponding equation for c can be derived by the same procedure. 
An alternative method is to start from the second difference equation and apply the 
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same fwitsoip principle twice, to the second and first differences successively, to 
get to the same representation as above of s2, (and c5,..;). It seems appropriate to 
call it the samkalitam (a word used to mean both the finite sum and its limit as an 
integral in YB) representation. 

What does this representation represent? Nothing more or less than the finite- 
difference counterpart of the formal integration of the relevant differential equation. 
For instance, in the limit ô — 0 of the representation 


n 
San = 281 > C2j-15 
j=! 


2s; tends to d0 and the sum becomes the integral: 


A 
s(8)— J c($)dó. 


Similarly the representation of s5, as a sum of 52; becomes 


s(8) — -f ag f ads. 


In other words, modulo the imposition of initial conditions, what is in play here 
is what becomes in the limit the technique of turning differential equations into 
integral equations by an appeal, usually tacit, to the fundamental theorem. 

The samkalitam representation continues to hold when n is replaced by any 
i < n, since @ is an arbitrary angle. For all i > 1 it is a nontrivial expression of 
the content of the difference equations (for i = 1 it is empty: s2 = 52). But it is 
an auxiliary result — it does not solve the problem of finding sin@ as an explicit 
function of 9. YB says: "Here [we have seen that] the difference between the arc 
and the chord is caused (or expressed) by all the chords that lie below the chosen 
arc (istacáapam). As [we] do not know any of these chords, think of the chord as 
the arc itself and do the samkalitam of the arc" ([Sarma] 7.5.2). So the first guess 
is to replace s24 by sp := 26k for all k < i and to compute s»; (in particular 55, ) 
from the samkalitam representation. But before describing the result and its further 
refining, it seems best to define and fix the notation, once and for all, for certain 
combinatorial coefficients in terms of which they are expressed. These coefficients 
are recursively defined as 


Se = 3 Sea) 


j=l 
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with 


Si) :— Y j. 
j=l 


In particular S,(1) = 1 for all k. They have simple algebraic expressions which 
have an interesting history and to which we will pay some attention soon but, for 
the present, it is enough to work with the definitions. 

The samskàram is carried out as follows. In the first step set s2; as go — 2ió for 
alli < 1 buttreat the factor 4s? =: a asa known parameter not subject to correction 
since there is no equation to determine it. With this input, the first approximation 
for 55, is a sum-of-sums: 


end —2a0( — 1)-----4-1)- ((n —2)- ---- D 4---- 4 1] 
= 2nd — 2a6(S\(n — 1) 4- --- + S(1)), 


and hence simplifies to 
qu = 2ón — 26a S;(n — 1)). 


In the next step, the input values s fori = 2,--- ,n — 1 (s? does not change 
since there is no equation to determine it) are chosen as the output of updating the 
preceding sines: 


s? = 26(i — aS,(i — 1). 


Hence the second approximation for 5», is 
Sy) = 2nd — 2ad{[((n — 1) — a$i(n — 2)) + --- + (1 — a S2(0))] + (n — 2) 
— aS2(n — 3) + - -- + (1 — a S2(0))] + - - - + [Cl — aS2(0))]}. 


Using successively the recursive definitions of S3 and S4, we get the final form of 
the second samskāram as 


sf? — 2nà — 2a5Sp(n — 1) + 2a75S4(n — 2). 


The samkalitam representation being linear, it is straightforward to extend the 
process indefinitely (no need to invoke analogies). The final result is an expansion 
of 52, in powers of a: 


Son = 26n — 26aS2(n — 1) +--+ +26(—1)ka* Sa (n — k) +- . 


This is quite a remarkable formula. For any finite n it terminates atk = n — 1 
since S,(0) = 0 by definition for any k but that will not give the exact value of 
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52, because the error in the first guess of replacing chords by arcs will propagate 
through the iterations. One can undo the error by choosing sz; = isz (i.e., i times 
the first chord segment rather than the arc) as the first guess and carry out further 
refinings as before, resulting in 


> 0 0 A 0 
sin@ = sin — ( — 4sin? — $x(n — 1)4- 4° sin? —S4(n — 2) + «) ; 
2n 2n 


n 


For n = | we get the tautology sin 8 = sin 8 as expected, for n = 2 the Aryabhatan 
relation sin@ = 2sin(0/2)(1 — 2 sin? 0/4), etc. In fact the formula is nothing but 
the generalisation of Nilakanthan's exact solution for the sine table to arbitrary 
divisions of an arbitrary angle. 

YB does not quite get to describing the finite series. It has, once again, more am- 
bitious goals than a clever trigonometric identity. After setting out the samskaram, 
it straightaway takes the limit of large n (for fixed 9) in all the trigonometric factors 
occurring in the series (with the usual reference to pararddham and "endless atomi- 
sation"): 26 > 0/n,a — 0? /n?, leaving the coefficients $5,(n — k) untouched for 
the moment. As n tends to infinity, the finite series turns into a power series whose 
dominant terms give 


3 5 


0- GF 
sinü ~ 6 — — a(n — 1) + —$S4(n—2)—--.. 
n? nd 


The asymptotic form of S,(m) is actually easy to find and already available from 
chapter 6 of YB: S,(n) ~ n**! /(k + 1)!, leading finally to the sine series. 

The derivation of the cosine series proceeds in parallel. Also, instead of solving 
the second-difference equation for the sine directly, we could have started from 
the two first-difference equations for the sine and the cosine, written down the 
samkalitam representations for them, made the first guesses sin@ = @ andcos = 1, 
refined them in alternation and arrived at the same final series. 

The samskaram technique can be used even more readily after taking the limit, 
to solve the resulting differential equations. As noted earlier, we can appeal to the 
fundamental theorem to formally integrate the differential equations to the integral 
form of the samkalitam representations; for instance the second order equation for 
the sine becomes the integral equation 


0 $i 
sing =0— | de d$» sin d», 
0 0 


on supplying the limits and constants of integration and using the initial conditions 
sin0 = 0, cos0 = 1. The samskaram way of solving it is to substitute the first 
guess sing; = 4 in the integrand on the right to produce sin = 9 — 8? /3!, then 
substitute this expression back into the integrand and so on. Alternatively and even 
more faithfully to the three samskaram steps listed in section 3.1, replace the sine 
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in the integrand by its integral representation repeatedly. The result after k steps is 
the identity 


0 hi 2—1 
sind = 0 — — E 3t (iar: jj di f doz- im d$», sin hk, 


which, in structure, resembles the elementary recursive identity first encountered 
in section 3.3 but by now has evolved into a more intricate samskaram equation. 
We know that the last (error) term vanishes asymptotically but, unlike when dealing 
with the arctangent series, YB does not talk about the convergence of the sine/cosine 
series. 


4.6. An Inductive Proof That Is Absent 


In some contrast to its treatment of the coefficients in the arctangent series where 
their inductive evaluation was done after their asymptotically dominant form was 
determined, YB describes exact formulae for the coefficients S; (n) for general k and 
n. It starts by giving the traditional geometric yukti for sums of natural numbers 
Sı(n) = n(n + 1)/2 in surprising detail and then goes on to state the rule for S2(1) 
and S,(n) more generally: “Multiply together the padam (the whole arc, in my 
notation 7), padam plus one and padam plus two. The result of dividing it by the 
product of one, two and three is the second samkalitam. In the same manner multiply 
the numbers which are increased by one at a time and divide it by the product of the 
same number [of factors] one, two, etc. The result is the samkalitam which is one 
below" ([Sarma] 7.5.3): 


n(n 4- 1)---(n 4 Kk) 
(k + 1)! 


Why does YB take the trouble of stating the exact formula for S,(m) when all it 
is interested in is its asymptotic form S,(n) ^ n*+!/(k + 1)! which was already 
obtained and stated in chapter 6? The reason, almost certainly, was the fact that it was 
known. According to [RSS], the general formula appears in a work of Narayana of 
the mid-14th century, which probably makes it its earliest occurrence. Slightly later 
than YB, the formula is also set out in Sankaran’s Kriyakramakari, a commentary 
not on 7S but on Bhaskara's Lilavati. But none of these sources contains a yukti for 
the formula and that includes YB. This is especially surprising in the case of YB as 
an inductive proof, very much in line with what it has already accomplished and not 
significantly more challenging, can be readily constructed. A modern formulation 
of the proof will go as follows. 


S(n) = 


48To my knowledge, none of the commentators on YB have supplied a proof either. [TA] verifies 
the formula for k up to 3, by resorting to the known sums of low powers of i which obviously could 
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Thee 
Ti (n) :— nin T1: AM tb Ki 
(k+ 1)! 
Then 
D n(n + 1)---(n-—1+k) 
T(n) — T(n — 1) = — Gp oe DA 
n(n-+1)---(n+tk—1 
= n(n + D---(n-Fk —T) = T;-\(n). 
k! 
But from the recursive definition of $,(1) we have directly $,(n) — $;(n — 1) = 
S11). So the result S,(n) = T;(n) will follow for given k and n if Sy_\(n) = 
T, —1(n) and S;(n — 1) = T(n — 1). It is sufficient therefore to show that 
S,(1) = 7, (1) for all k and S;(n) = T(n) for all n. Now i) $;(1) consists of 
the single term S, (1) and so on: S,(1) = --- = Si(1) = 1 = 7;(1); and ii) 


(by definition), 7;,(1) = n(n + 1)/2 = S,(1) (as YB has taken the trouble to show 
geometrically). End of proof. 

Even if writing out this proof serves no other purpose, it will have convinced the 
reader that it certainly was not beyond the mathematical (and literary) powers of 
YB: itis quite as easy to turn it around and present it in YB’s preferred style, starting 
with k = n = | and building it up one step at a time ink and n. 

In seeking to identify the reason(s) for the general silence on the yukti, we may 
discount the notion that there was in fact no yukti and that the general formula was 
an extrapolation from small values of k = 1, 2 (known since Aryabhata, Ganita 21 
of AB) — the Kerala mathematicians most particularly were not in the habit of stating 
results lacking sound logical support. Sankaran's enunciation of the general formula 
for S, (1), more precise than YB’s, is actually followed by the line “The yukti of this 
is not easy to follow (not sugama) and so is not described here". (The whole verse 
will be found in [6] p.248). And it is preceded by a geometrical demonstration of the 
cases k — 1,2 involving a clever cutting up of the area and volume of suitable two 
(the same as in YB) and three dimensional figures. As Sarasvati Amma [6] wonders, 
how can such a demonstration be extended to k > 2 “unless one were to conceive 
spaces with more than three dimensions", especially in a work professedly meant 
for “the benefit of the not so intelligent" (alpadhiyam hita)? 

It is tempting to suggest that what the slow-witted really found difficult was 
the transition from a long-familiar geometrical and arithmetical mode of reasoning 
to one based on combinatorial algebra and, even more disorientingly, on novel 
logical uses of recursive thinking that an inductive proof represented. That would 


not be generalised since sums of general powers were not known. I got the inductive proof below from 
Chandrashekhar Cowsik to whom I express my thanks. 
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presuppose that such proofs were a genuine breakthrough, born of a conscious 
recognition that geometric methods had hit their natural limitations and that new 
yukti-techniques were called for. The extreme care with which the first recorded (as 
far as we know now) inductive yukti, for the terms in the z series, is constructed 
and presented in YB is then fully justified — the sums of powers greater than two 
that occur there are as resistant to geometry as the coefficients S,(m). And if the 
only yukti available for the computation of the latter was in fact inductive, we will 
have to conclude that Narayana who lived just before Madhavan, we are not certain 
where, was an adept of such techniques. 

All this only deepens the mystery of the immediate antecedents of Madhavan's 
achievements, he himself having left too little of his own to throw any light on it. 
We are all the more fortunate then that so many of his followers turned out to be 
such prolific authors. 


5. 'Two Footnotes 
5.1. Decimal Numbers and Polynomials 


One of the most interesting sections of YB not directly tied to calculus is concerned 
with the development of a notation and a proper set of rules for dealing with algebraic 
operations on functions involving powers of one variable. The context in which the 
need for such a general algebraic machinery emerges is that of the estimation of 
the corrections to be applied when the z series is truncated. As described in section 
3.5, the remainder r; after i = (j + 1)/2 (j a positive odd integer) terms satisfies 
a recursive equation in j and r; is to be chosen, effectively by trial and error, so as 
to satisfy this equation as closely as possible. It is in this connection that YB makes 
its only explicit allusion to the notion of a function, also as noted in section 3.5.” 
The truncation point not being fixed in advance, j is treated as an avyaktarasi, “the 
unknown", taking values in positive odd integers. YB says ([Sarma] 6.8, p.74): 


Then [one] has to know (or learn) how to do computations (the expression used 
applies to a traditional method of calculating with tokens placed in compartments 
drawn on the floor, on a piece of cloth, etc.) with quantities without a [fixed] 
numerical value. When [such an] unknown is assigned a numerical value, a number 
[which was] in a [given] place occupies the next higher place [when multiplied] by 
that number. Thus are the place differences to be thought of, not tenfold as by one, 
ten, hundred, etc.- - - . 


49This is the closest YB comes to working from a genuinely abstract algebraic standpoint, and it 
marks a clear advance over Bhaskara's Bijaganita. Those who will wish one day to enquire into the 
direction Indian mathematics might have taken after Kerala, had historical circumstances been different, 
will find this part of YB a fascinating piece of text to parse and to ponder over. 
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So a natural number 7 in the first place stands for n itself, in the second place 
for n x j and so on. When the coefficients are fractions, they can be reduced to 
a common denominator, i.e., an overall numerical divisor, and for negative co- 
efficients, YB cites Brahmagupta's rules for multiplication among numbers “of a 
positive or a negative nature" (dhanabhiitam or rnabhütam). What results is a nota- 
tion that characterises a polynomial function of one variable (over the rationals) by 
the numbers in the compartments, i.e., by the ordered sequence of its coefficients 
which, up to a common denominator, can be a positive or negative integer — thus ac- 
counting for YB's sudden interest in Brahmagupta's old rule for multiplication with 
negatives. ?? : 

The functions that turn up in the samskdram work are not polynomials but involve 
(small) powers of j and 1/j. The error after truncation at j: 


l ] I 
RI. 
rj-2 rj J 


e(j):= 


for the trial choice rj: = 2j — 2,rj = 2j + 2 is an example. Such functions 
are also to be reduced to a common denominator by extending the usual numerical 
method, which involves addition and multiplication, to polynomials. The result 
will be a rational function of which the numerator and denominator are treated 
independently. The algebraic needs of the operations on the functions occurring in 
all the variants of the correction techniques met with in section 3.5 are thus fully 
met if the operations of addition and multiplication are defined for polynomials. YB 
proceeds to explain how that is to be done and understood with the help of relevant 
examples. I quote the first part of that passage ([Sarma] 6.8, p.75). 


Here [I] show how the work is to be done when the unknown (rdsi, the reader 
will have noted that this word is used in two different senses in this section) is 
taken to be the last odd number (the truncation point j). Make (literally, “write” or 
"draw") two rows of compartments (khandam) so that each compartment encloses 
one place. In this the upper row is for the numerators and the lower row for the 
denominators. The odd number (the unknown j) is |1{0| (I have used vertical bars 
to separate the compartments). If the rai is negative (since j is always positive, 
what is meant here are the coefficients of powers of j) it must be distinguished by 
some mark. For zero, whatever thing is used for it. 


The divisor in the first correction term (1/(2j — 2)) is two less than twice the 
unknown. For it, two in the second place and negative two in the first place: |2|2 » | 
(where » marks the negatives). Then the divisor in the second correction term is 
two more than twice the unknown. For that, in the second place, twice the rdsi, that 
is two and, in the first place, positive two: [22]. 


50 is ironic that on the unique occasion on which YB resorts to a symbolic representation of a 
mathematical object, the resulting notation is more compact than the modem way of writing out a 
polynomial. 


CC-0. Gurukul Kangri Collection, Haridwar 


— 


CQ———— 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


346 P. P. Divakaran 


The passage continues with a description of the multiplication and division of 
a polynomial by a number (obvious) and the multiplication of two polynomials 
(relying on the fact that j is actually a number though unknown). There are no 
surprises here; the time and care spent on the explanation are perhaps, once again, 
warranted by the conceptual innovation. 

At this early stage of our appreciation of the riches to be mined from a detailed 
exploration of YB, there is nothing much to add except perhaps to stress again 
how closely the notation for and the algebraic operations with polynomials are 
modelled on the the writing of numbers in the decimal place value notation and 
arithmetical operations with them. A polynomial is uniquely characterised by the 
coefficients of powers of the unknown. The unknown (variable) is left implicit just 
as the base 10 is in the decimal notation; it is not assigned a symbol but has a 
name, (avyakta)rasi. The only difference is that the coefficients are from the ring 
of integers and negative coefficients have to be explicitly distinguished whereas in 
the decimal representation of numbers, each coefficient is a nonnegative integer 
modulo 10 — there are no carry over rules in the algebra of polynomials. We may 
nevertheless feel confident in the guess [16] that, for the Kerala mathematicians, 
the decimal numbers served as a bridge not only to the old idea of the infinite, but 
also to the acceptance of infinite series both numerical and functional, just as they 
did for Newton (section 1.4). 


5.2. The Roots of Samskaram, Roots by Samskaram 


How far back in antiquity can we trace recursive computational techniques of 
the sort we have come across in YB, more particularly in their samskaram form? 
There are very credible indications that the idea of making an initial incorrect 
guess at the solution of a problem which is then sequentially refined to any desired 
degree of accuracy is very old indeed, as old as the earliest recorded mathematics 
in India, the Silbasütra manuals (8th-7th C. BCE) [31]. Three of them, those of 
Baudhayana, Apastamba and Katyayana, have an identically expressed rational 
numerical approximation for the square root of 2 which is most easily and logically 
obtained (the specific arithmetical formula given in the texts is a good guide in this) 
by an exercise in the method of samskaram. The technique also seems to be at the 
back of a rational algebraic formula for the approximate square root of a general 
nonsquare positive integer n found in the much later Bakhshali manuscript. It seems 
best to discuss these together, but in anti-chronological order, beginning with the 
general case. 

The period of the Bakhshali manuscript has been a contentious issue, various 
scholars suggesting various dates ranging between the Ist and the 13th centuries CE. 
According to Takao Hayashi whose critical edition [15] contains the most thorough 
study we have of the work, it was most likely composed in the 8th century. (That 
makes it substantially more than a thousand years younger than the Sulba manuals). 
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Our interest is in one sūtra (sūtra Q2 in Hayashi's nomenclature) which is cited on 
more than one occasion in the surviving fragment of the text and which contains the 
formula mentioned above. The literal meaning of the verse is not unambiguous but 
there is unanimity on its mathematical content (see Hayashi [15]): 


r l r X2 l 
X I n=m+>—-—-~(—) 
vin 2m 2\2m/ m+ 2d 


where ni? is the square number nearest to and less than n and r is the difference 
n — m?. Bakhshali of course gives no proof or explanation but it is difficult to look 
at this formula without immediately recognising its samskaram credentials. In that 
spirit, the obvious first guess is x; := „/nı = m so that r is a measure of the error. 
It is easy to reexpress z as the difference of two squares: 


r2 TENA 
JE m4) -(—) : 
( 2m 2m 
a type of reordering very popular in Indian mathematics. The next approximation 


X5 is obtained by neglecting the second (and second order) term in the above exact 
formula: 


r 
X2:— Jn, =m + — 


2m 


Now define r’ to be the difference between the exact value and the approximant x2: 
r 


r’ 
2m i 


(m? T r)? —m- 


which equation, on squaring and neglecting r^ but not (r/2m Y^, leads to 


j 2 
4 --(z;) 2 (m 4- 5) 


and hence to the Bakhshali formula as x3 in this sequence of approximations. The 
question of course is, is this how the author of sūtra Q2 arrived at the formula? 
. Since it is not obviously a result that should be “self-evident to the intelligent" 
(Nilakanthan, alluding to the theorem of the diagonal), there presumably was a yukti 
and going by the choice of the ‘variables’ (the nearest perfect square and its deviaton 
from the required answer), we can at least tentatively assume that it was a recursive 
one, not very different from the one above. Indeed just such a justification has 
been suggested by commentators, the closest to the method of samskáram (though 
without drawing the parallel) being that of Channabasappa [32]. 

The iteration can be carried forward indefinitely by consistently neglecting the 
quadratic term in the error at each stage, again in the general spirit of recursive 


———— Í— n" 
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refining. The result is easily expressible in an explicit infinitely recursive formula — 
a particularly neat and compact formulation is given in Hayashi [15], formula 
(11.1)! What is interesting now is that we can also evaluate „y/n to arbitrary 
accuracy by expanding /n = ((m + r/(2m))? — (r/(2m)9): in powers of the 
second term in a (convergent) binomial series. The two series are not the same — 
successive terms have, in general, different structures for the denominators — and 
there is no reason why they should be; they are expansions in different variables. 
Nevertheless, the first three terms of the binomial series coincide exactly with the 
three terms in the Bakhshali formula. The first moral to be drawn from this is 
that the absence of a general binomial theorem was not much of a handicap in 
finding alternative (convergent in this case) infinite expansions, as noted earlier (the 
end of section 3.3). The second comes from the similarity and dissimilarity with 
Madhavan's interpolation formula (section 4.3) which also agrees with the Taylor 
series in the first two terms: whether a given formal iterative expansion is a valid 
one is to be independently checked in each problem. 

YB does not refer to the Bakhshali formula; it has no occasion to. But given the 
uncanny similarity of the only yukti we know for it to the general philosophy of 
samskaàram, we are obliged to make doubly sure that this affinity is not something 
we have imposed on the actual, bare-bones, formula by overinterpreting it. In other 
words, how can we be sure that it was thought of as the starting point of an exact, 
endlessly recursive, procedure as the paragraphs above (and Hayashi's formula 
(11.1)) suggest, and not just as a clever and empirically reliable approximation 
without a general theoretical basis? It is here that we can turn for help to the 


numerical Sulbasütra formula for 4/2. That formula is (literally reading the verses 
for the various numbers involved) 


1 1 1 
V0) = leet oe 
an eer 3 x 4x34 


On the other hand we can apply the Bakhshali formula to J/2(m = 1,r = 1) to get 


i 
f dm RES 
up» 


which is numerically equal to the sum of the first three terms of the Sulbasitra 
formula. Even more strikingly, the next samskaram in our conjectured general yukti 
for Bakhshali leads to a correction term which is —1/408, again identical with the 
4th Sulbasiitra term (discounting the particular factorisation employed). 

The book of Sen and Bag describes several suggested reconstructions of the 
Sulbasütra formula for 4/2 without mentioning the general Bakhshali formula. 
None of them agrees in every step with our conjectured Bakhshali yukti. But one 


51 Just to avoid any misunderstanding, such a general formula does not occur any where in Bakhshali. 
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due to L. Rodet, published in 1879, two years before the Bakhshali manuscript 
was even unearthed, comes very close. It starts with the first guess /2 = 1 + 1/3 
(rather than just 1) and in two steps of numerical samskdram gets to the Sulbasutra 
approximation. It is obvious that Rodet chose the first two terms to reproduce 
the 1/3 term and it is equally obvious that the dharma of samskaram imposes 
no conditions on the first guess except those of practical utility (economy) and 
theoretical soundness (convergence, for instance). In any case, despite the numerical 
coincidence seen above, there is no way to be absolutely sure that the putative 
yukti(s) of our two sources were the same. The numbers which appear as the various 
terms and factors may or may not point to a particular method of arriving at the 
result; it is not impossible for example (though it appears unlikely) that the choice 
of the denominators 3,3 x 4 and 3 x 4 x 34, identically expressed in all three 
Sulba texts dealing with the question, was dictated by the demands of metrical 
composition. 

It is also something of a surprise that it is the earlier text that carries the refining 
one step further than the later. Here again, it may well be that the author of sūtra 
Q2, having got an accurate enough three-term formula for reasonably small r/(2m), 
was daunted by the task of expressing in verse the fourth term, a fairly complicated 
rational expression involving two variables (m and r) and several numbers, rather 
than by mathematical difficulty. Not every mathematical versifier is a Sankara 
Variyar. 

With reasonable confidence, we may conclude from all this that the idea of 
recursion as captured in the method of successive refining has been a constant of 
Indian mathematical thinking and technique over the two millennia separating YB 
from the late vedic era. The earliest Sulbasütras were probably assembled some time 
after the compilation of the particular version of the Yajurveda containing the famous 
list of powers of 10 and the knowledge they display may well be contemporaneous 
with, or even anterior to, that text. As for Bakhshali, that comes, very likely, soon 
after those number-happy centuries when naming and listing impressively large 
numbers had become an end in itself. (A case can me made for its date to be 
somewhat earlier than Hayashi’s, on the basis of what is reasonably certain about 
the political history of northwest India). The intervening period is surely one in 
which Indian thinkers, in addition to mastering arithmetic, lost their fear of infinity 
as the philosophical and canonical (including the astronomical) literature of Jainism, 
Buddhism and the Hinduism of the time demonstrates. From the recursive infinitude 
of numbers-with-a-base, it is a natural step to power series, a step taken also by 
Newton in his turn and, more generally, to the idea that a recursive computation 
need not end for it to make sense, that a well-defined quantity such as a number or 
the length of an arc can be expressed meaningfully in an unending series of terms. 
And the liberation probably came early — it is notable that all three of the Sulba 
manuals that give the approximate value of 4/2 end their enunciation of the formula 
with the phrase savisesah, “with remainder’. 
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Thus, born out of the systematics of the decimal number nomenclature and 
marking its first success as a clever scheme of approximation in arithmetical prob- 
lems, recursive thinking acquired such depth and richness with the evolution of 
mathematical ideas that, by the time of Madhavan, it would seem that mathematics 
in every one of its aspects, conceptual, computational and logical, was almost in- 
conceivable except in fundamentally recursive terms. Among the milestones which 
mark this journey three are particularly significant: the decimal numbers and the 
development of arithmetic of course, Aryabhata’s trigonometry and the sine table, 
and the invention and elaboration of calculus. Indeed a reading of Yuktibhasa makes 
one wonder whether Madhavan’s calculus would at all have been possible without 
the power and subtlety that these methods had by then acquired. 
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Planetary and Lunar Models in 
Tantrasangraha (c.1500) and 
Ganita- Yukti-bhasa (c.1530) 


M. S. Sriram 


1. Introduction 


Aryabhatiya (c.499) is the first extant text on mathematical astronomy in India to 
discuss the explicit algorithms for the computation of planetary positions [1]. Ec- 
centric/epicyclic models are somewhat implicit in these algorithms in Aryabhatiya, 
and in the texts which followed. In contrast, the planetary theory in Ptolemy’s Al- 
magest [2] and the later Islamic works which were influenced by Ptolemy, as also 
the Copernicus's de Revolutionibus [3], are based on explicit geometrical models . 
In essence, the motion of any planet is viewed as a combination of uniform circular 
motions in these models. No such rigid principles are insisted upon in Aryabhattya 
or even in the later Indian texts. Though most texts do not discuss the geometrical 
models at all, there are exceptions. Nilakantha Somayaji who introduced a major 
revision of the traditional Indian planetary theory in his Tantrasarigraha [4,5,6]. 
outlined a geometrical model of planetary motion in his other works, Golasara, 
Siddhantadarpana and Aryabhatiya-bhasya. In this model, planets move in eccen- 
tric orbits inclined to the ecliptic around the mean Sun, which in turn goes aound the 
earth [7,8]. We have a detailed account of this model in Ganita-Yukti-bhasa, pop- 
ularly known as Yukti-bhasa (c.1530), authored by Jyesthadeva [9]. Here onwards, 
[9] will be indicated by just Ganita-Yukti-bhasa in all the specific references to the 
passages in it. 

Apart from discussing the geometrical picture of planetary motion, Yukti-bhasa 
also presents an interesting geometrical model of lunar motion while discussing 
the so called 'dvitiya - sphuta’ or the ‘second correction’. This correction termed 
‘Evection’ was first introduced by Ptolemy in his Almagest, where he proposed a 
geometrical model for Moon’s longitude, which incorporates evection, apart from 
the correction due to the eccentricity of the orbit. Ptolemy's lunar model had a glaring 
defect as the ratio of the maximum and minimum apparent angular diameters of the 
Moon would be around 1.92 in this model, which is patently absurd. This deficiency 
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of Ptolemy’s lunar theory had been rectified in some later Islamic works which other- 
wise follow the Ptolemaic framework, as for instance in Ibn al-Shatir’s (1304-1375/6 
CE) lunar theory,which seems to have been adopted by Copernicus also, except for 
small change in parameters [10]. In the Indian tradition, the second correction to 
the Moon makes its first appearence in Laghumanasa of Mañjulācārya (c.932) 
who essentially gave an analytical form for it'[11]. Tantrasarigraha also gives the 
algorithm for the second correction which has the same form as in Laghumdnasa, 
but goes ahead to give the formula for the true distance of the Moon, incorporating 
this. We have an explcit geometrical model for the same in Yukti-bhasa. This model 
does not show any problem of unresonable variation in the earth-Moon distance 
that is associated with Ptolemy’s lunar theory. The corrections to Moon’s longitude 
as given in the lunar theories of both Tantrasargraha/Yukti-bhasa and Ibn al-Shatir 
compare reasonably with the modern expression for the two corrections, though the 
geometrical models are very different. Also, the patterns of variation of the lunar 
distance are different in the two models. 

In section 2, we discuss the salient features of planetary computations in Indian 
siddhantas begining with Aryabhatiya. In section 3, we provide a brief outline of the 
contents of Tantrasangraha and Yukti-bhasa. The planetary model in Yukti-bhasa is 
described in detail in section 4. For the sake of comparison, the geocentric position 
of a planet in Kepler's model is discussed in Appedix 1. In section 5, we discuss the 
lunar theory of Yukti-bhasa and briefly compare it with the lunar thories of Ptolemy 
and Ibn al-Shatir which are outlined in Appendices 2 and 3 respectively. 


2. Planetary Computations in Indian Siddhantas 


The Siddhanta period in Indian astronomy [12] begins with Aryabhatiya which 
was composed in AD 499. There were indeed earlier siddhdntas which have been 
quoted by the later astronomers but are not available today. Mathematics devel- 
oped with astronomy in India. Siddhantas contain correct mathematical solutions 
to astronomical problems, namely calculation of positions of planets (Sun, Moon, 
Mercury, Venus, Mars, Jupiter and Saturn), and diurnal problems involving spher- 
ical astronomy, and eclipses. The diurnal problems include, finding north-south 
directions, latitude of a place, Sun's diurnal path, its declination, determination of 
the times of sunrise/sunset, measurement of time (from shadow), relations among 
various celestial coordinates, calculation of lagna (point on the ecliptic which is on 
the horizon at any time), etc. The treatment of eclipses includes the calculation of the 
instant of conjunction, the two half-durations (from the first contact to the middle, 


lyallayya in his commentary on Laghumdnasa mentions that this correction appears earlier in 
Vatesvarasiddhàünta(c.904) of Vate$vara. However, the currently available version of the latter work 
does not include the second lunar correction. 
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Table 1: Planetary revolutions in a Maháyuga as given in Aryabhatiya, and the 
inferred sidereal periods. * The revolution numbers of Mercury and Venus refer to 
their szg/iroccas (heliocentric values). 


T No. of Revolutions | Period of Revolution | 

Planet inaMaháyuga _| in days (Sidereal period) | Modern value | 
Sun 43,20,000 365.25868 | 365.25636 | 
Moon 5.71.53.336 27.32167 | 2732166 | 
Moon's apogee 4,88,219 3231.98708 | 3232.37543 | 
Moon’s nodes 232,226 6794.74951 | 6793.39108 | 
Mercury* 1,79,37,020 87.96988 | 87.96930 | 
Venus* | 70,22,388 224.69814 | 224.70080 

Mars 22,96,824 686.99974 | 686.97970 | 
Jupiter 3,64,224 4332.27217 | 4332.58870 

Saturn 1,46,564 10766.06465 | 10759.20100 | 


and from the middle to the last contact), duration of totality and the magnitude of 
the eclipse, and so on. 

Aryabhatiya is the earliest available siddhanta text in India, which contains 
a systematic account of most of the traditional astronomical problems [1]. It is 
mentioned in the text itself that it was composed 3600 years after the beginning 
of Kaliyuga corresponding to AD 499 when it is stated that the author was 23 
years in age. Aryabhata composed this work in Kusumapura, which is the same 
as Pataliputra (near modern Patna). From Aryabhatiya onwards, one considers a 
Mahayuga of 43,20,000 years. The number of revolutions in the stellar background 
made by the planets in the Mahayuga are given in all the texts (see Table 1). 

In the case of Mercury and Venus, the texts give the revolution numbers of 
their Sighrocca-s, which are nothing but their heliocentric revolution numbers. The 
number of civil days in a Mahayuga known as the Yugasavanadina is also specified. 
In Aryabhattya, its value is given as 1,57,79,17,500. 

From the above data, the mean longitudes of the planets can be calculated for any 
time. Normally, it is assumed that the mean longitudes are zero at the beginning of 
kaliyuga. In Aryabhatiya, this is taken to be the mean sunrise at Ujjain on February 
18, BC 3102. 

Now, the apparent motion of the Sun, Moon and planets in the background 
of stars is not uniform. The planets (including the earth) and the Moon move in 
elliptical orbits around the Sun, and the earth respectively. The ellipticity of the 
orbit gives rise to the non-uniform nature of the motion. This is essentially Kepler's 
picture. In the geocentric frame of reference, the Sun moves around the earth, and 
the geocentric longitude of the Sun would be the heliocentric longitude of the earth 
increased by 180°. The geocentric longitudes of the planets would be related to their 
heliocentric longitudes and the geocentric longitude of the Sun. 
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Even in the ancient times, the predictions of planetary positions were fairly 
accurate. This means that the ancient models would have in some sense been 
approximations of the Kepler model in the geocentric frame [13]. In the Indian 
planetary theories, two corrections were applied to the mean planet to obtain the 
‘true’ geocentric longitudes, and they are described below: 


(1) Manda-samskara: This takes into account the non-uniformity of motion 
resulting from the eccentricity of the planet’s orbit. An epicyclic or eccen- 
tric model is used to obtain the manda-correction. This corresponds to the 
‘Equation of centre’ in modern terminology. The manda-corrected mean 
planet is called mandasphuta-graha or simply mandasphuta. This is the 
true geocentric longitude in the case of the Sun. For the Moon, a second 
correction has been specified in some later texts to obtain the geocentric 
longitude, which will be discussed later. In the case of the planets called 
taragraha-s (traditionally, only Mercury, Venus, Mars, Jupiter and Saturn), 
the mandasphuta is the true heliocentric longitude. 


(2) Sighra-samskara: For the planets, one more correction, namely sighra, has 
to be applied to obtain the true longitude called sphutagraha. The sighra- 
samskara converts the heliocentric longitudes of the planets to geocentric 
longitudes. 


In Aryabhatiya, the above two corrections have been discussed clearly. The 
planetary model described by Aryabhata roughly amounts to the planets orbiting 
around the Sun in eccentric orbits, with the Sun itself orbiting around the earth, 
though Aryabhata does not state it as such. We discuss the Indian planetary models, 
in particular the traditional confusion in taking the mean Sun as the mean planet in 
the case of the interior planets, Mercury and Venus, later. Significantly, the picture 
of latitudes of planets is broadly correct in Aryabhatiya. 

We may mention here some of the later astronomers and their ma- 
jor works [12]: Varahamihira (b. AD 505) (Paricasiddhantika), Bhaskara I 
(7th century) (Mahabhaskariya, Aryabhatiya-bhasya), Brahmagupta (7th cen- 
tury) (Brahmasphutasiddhanta), Lalla (8th-9th century) (Sisyadhivrddhida-tantra), 
Vate$vara (9th century) (VateSvarasiddhanta), Manjulacarya (10th century) 
(Laghumanasa), Sripati (11th century) (Siddhantasekhara), Bhaskaracarya II ( b. 
AD 1114) (Siddhantasiromani with Vasanabhasya). It is in Laghumanasa that 
the second correction for the Moon corresponding to ‘Evection’ makes its first 
appearence in the Indian tradition. Bhaskara II has also included this correction 
for the Moon. This correction arises due to the influence of the Sun on the lunar 
orbit. 

SiddhantaSiromani is a compendium which contains most of the results known 
to Indian astronomers at that time. After Bhaskara II, there was very significant 
work in astronomy in Kerala during 14th - 17th centuries, which also lead to 
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some major innovations in planetary theory. We will discuss them in the next 
section. 


3. Tantrasangraha and Yukti-bhasa 


Kerala was a stronghold of Aryabhatan school of astronomy [14]. Aryabhatiya be- 
came popular in Kerala from early times. Madhava of Sangamagrama (1340-1425) 
seems to have been the founder of the Kerala school. His known works are few 
in number. However, most of the distinctive contributions of the Kerala school 
in mathematical analysis, as well as in planetary computations are invariably at- 
tributed to him by the later astronomers from Kerala. Parame$vara of Vatasseri 
(c. 1360-1455), Nilakantha Somayaji or Somasutvan of Trikkantiyur (c.1444-1550) 
and Jyesthadeva of Parakroda or Parannottu family (c.1500-1610), are some of the 
major figures of this school. Emphasizing the need for revising the planetary para- 
meters through observations, Parame$vara introduced the Drgganita system. Apart 
from Drgganita, his other major works are Bhatadipika which is a commentary on 
Aryabhatiya, Siddhantadipika, and Grahanamandana on eclipses. He was one of 
the first astronomers to discuss the geometrical model of planetary motion in some 
detail. 

Tantrasangraha, composed in AD 1500, is Nilakantha's major work. 
Aryabhatiya-bhasya composed by him late in his life is perhaps the most elab- 
orate commentary on Aryabhatiya, and is yet to be translated and studied in depth. 
Jyotirmimamsa, Siddhanta-darpana and Golasara are some of his other important 
works. Jyesthadeva is the author of Ganita-yukti-bhasa. He authored one more work 
called Drkkarna. 


Tantrasangraha 


Tantrasangraha is a comprehensive text which discusses all aspects of mathe- 
matical astronomy such as the computation of longitudes and latitudes of plan- 
ets, various diurnal problems, determination of time, eclipses, visibility of planets 
etc. However, explanations are not provided, save on some rare occasions, as this 
belongs to the Tantra class of texts which are intended to be computational in 
nature. The explanations of the algorithms are to be found in Yukti-bhdsa, and 
in the two commentaries of the text, namely Laghuvivrti and Yukti-dipika, both 
by Sankara Variyar of Trikkutaveli (c.1500-1562). In what follows, we sum- 
marise those portions of Tantrasarngraha which deal with planetary and lunar 
theory. 

The revolution numbers of the Sun, Moon and the five planets given in this text 
differ little from the Aryabhatan values summarised in Table 1. The yugasávanadina 
or the number of civil days in a yuga is also the same as in Aryabhatiya. However, it 
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is noteworthy that while specifying the number of revolutions of the interior planets, 
Mercury and Venus, the word svaparyayah is used, which clearly means that the 
revolution numbers given refer to their own revolutions, and not to their sighroccas, 
as specified in the earlier texts. This is of great significance in the computation of 
longitudes of the inner planets, and will be explained later. 

The revolution numbers of the apsides and nodes of the planets which are needed 
for calculating the true longitudes and latitudes are also given. The mean longitudes 
at the begining of kaliyuga are not assumed to be zero, as in Aryabhatiya, and their 
corrected values are specified. The mean longitude of a planet can be calculated 
at any given time, given these epochal values, the revolution numbers and the 
yugasavanadina. 

In the earlier Indian texts, for the interior planets Mercury and Venus, the 
equation-of-centre-correction was wrongly applied to the mean Sun, instead of 
the mean heliocentric planet. This is true of Ptolemy's Almagest also, and clearly 
this correction does not correspond to anything physical. It was Nilakantha who 
set this right in Tantrasangraha. Here the manda correction is applied to the mean 
heliocentric planet for the interior planets also, just as in the case of exterior planets. 
This departure, as well as a clear analysis of the latitudinal motion, led Nilakantha 
to propose a geometrical picture of planetary motion in his other works. According 
to this, the planets move in eccentric orbits around the mean Sun, which itself moves 
around the earth [7,8]. 

The actual distances of the Sun and the Moon from the centre of the earth play 
an important role in eclipse calculations, as these determine the apparent sizes 
of the solar and lunar discs, as well as their parallaxes. This requires accurate 
computations of the positions of the Sun and the Moon. It is in this context that 
the second correction for the Moon is discussed in Tantrasarigraha. There is a 
similar correction for the Sun ,which does not alter its longitude, but modifies its 
distance very marginally. The mean distances are specified in the text. The true 
distance called dvitrya-sphuta-yojana-karna is to be computed by including the 
manda correction and the above second correction. 


Yukti-bhasa 


It is very significant that Yukti-bhasa is written in Malayalam, the local language 
of Kerala. It is really a textbook for students as it systematically develops the 
relevant mathematics and astronomy, and is self-contained. The basics are set forth 
at the beginning of each topic and all the results which follow are proved. The 
main concepts are often repeated for emphasis. Alternate methods for important 
formulae are given, wherever possible. There are no equations as we write now, nor 
do we find any diagrams in the available manuscripts. However, from the detailed 
descriptions given in the text, one can write down the equations, as also draw the 
figures without ambiguity. 
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Yukti-bhasa can be naturally divided into mathematics and astronomy portions. 
The mathematics part is really an independent treatise. A substantial part of math- 
ematics known to Indians at that time is included and explained in the work. The 
famous results on the infinite series for z , inverse tangent, sine and cosine functions 
are all proved here. It also includes a derivation of the surface area and volume of a 
sphere using methods which amount to carrying out integrations. 

The astronomy part is very comprehensive, and includes almost all that was 
known in India then, save some topics like instruments. The author's stated aim is 
to explain all the algorithms in Tantrasangraha. 

The first chapter in the astronomy part is on planetary theory. The epicyclic and 
eccentric theories are described for both manda-samskara (equation of centre) and 
Sighra-samskara (conversion from heliocentric to geocentric coordinates). The 
correct formulation of the equation of centre for interior planets is presented and 
the way it differs from the earlier formulations is noted. The geometrical model 
of Nilakantha is described in detail. The computation of geocentric latitudes is 
also described. The orbit of a planet has to be projected onto the plane of the 
ecliptic to obtain the longitude. This is explained. There is an interesting discussion 
on how one can obtain the geocentric coordinates of a celestial body, given the 
luni-centric coordinates. This indicates how comfortable Jyesthadeva was with 
coordinate transformations while going from one frame of reference to another, in 
the context of astronomy. 

The second correction for the Moon is dealt with in a later chapter. The 
Tantrasangraha results for this correction as well as the distance of the Moon 


are explained with a geometrical model. 


4. Planetary Model in Yukti-bhasa 


Manda-Samskára- Correction Due to Eccentricity of the Orbit in Indian 
Planetary Models 


We consider the explicit geometrical construction for computing the correction 
due to eccentricity described in Yukti-bhasa [15]. Essentially the same form of the 
correction term has been described right from Aryabhatiya down to Tantrasarigraha. 
The speciality of Yukti-bhasà lies in its detailed geometrical description. 

In Figure 1, the dashed circle, with O as the centre is the deferent circle, termed 
"kaksyàvrtta' . Its radius is ‘trijya’ , denoted by R. In the Indian texts, the distances in 
diagrams associated with planetary motion are expressed in minutes. Normally, the 
circumference of the deferent circle is taken to be 21,600 minutes, and thus the radius 
R would be the number of minutes in a radian, which is nearly 3438 . Throughout 
this article, this is the value of R, wherever it appears. A represents the direction of 
the apside (‘mandocca’) and let (OA = w. The mean planet Po moves along the 
deferent circle uniformly. The mean longitude is called *madhyamagraha’ and is 
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Pratimandala 
or 
Grahavrtta 


Kakshyavrtta 


Figure 1: Manda-samskara or the correction due to eccentricity of the orbit. 


given by Oy = TÔ Po.” Constructa circle of radius r with Po as the centre. This is the 
‘manda’ -epicycle (*manda-nicocca-vrtta' or ‘manda-vrtta’) on which the ‘manda’ - 
corrected planet (*mandasphuta') is located. Draw a line from P» parallel to OA, 
which intersects the epicycle at P . This is the location of the manda-corrected planet 
whose longitude is given by Qs =O P. This is the epicycle model. 
Alternatively, locate O' on OA such that OO' — r. The circle with O as the 
centre and r as the radius is also known as manda-epicycle. Draw a circle with radius 
R with O' as the centre. This is the eccentric circle, called the *pratimandala'. It 
is easy to see that the manda-corrected planet , P is located on this eccentric circle 
such that LÓ'P = rÔ Po = Oy. That is, the planet moves uniformly around O’, 
and not around O with respect to which the longitude is measured. This is the 
‘eccentric circle’ model. Clearly, it is equivalent to the ‘epicycle model’. 


2In this article, all the longitudes and latitudes are with respect to the ecliptic. For calculating the 
planetary positions, the zero point for the longitudes is the *mesddi’ or the ‘first point of Aries’, denoted 
by T in this article. In the Indian texts, the zodiacal signs are fixed in the background of stars and the 
corresponding longitudes are the nirayana ones. The ‘tropical longitudes’ which are qualified as sayana, 
corresponding to the vernal equinox as the zero point, are also used particularly in the treatment of 
diurnal problems. The difference between these two longitudes is due to the “precession of equinoxes’, 
and does not affect any of the arguments in the present article. 
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Draw P Q perpendicular to (extended) O Py. Now, P Py =r. P ÊQ -TÓP;- 
ro A = Oy — w = M is the anomaly or manda-kendra. and P Ô Py = TO P; — 
LORS m = Oms. Now 


PQ =r sin(P ÊQ) = OP sin(PO Po). (1) 
K = OP is the 'manda'-hypotenuse, called *mandakarna'. It is easily seen that 
K — [(R 4- r cos My. 4- r? sin? M]!?, (2) 


Yukti-bhasa follows the standard rule adopted in most texts of Indian astronomy 
(such as Aryabhatiya, Brahmasphutasiddhanta, Tantrasarigraha, etc.), that r is not 
a constant but proportional K, such that 


r ro 3 
K Wap Vt 
where ro is prescribed in the text for each planet. ro is essentially the mean radius of 
the manda-epicycle, and could itself depend on the anomaly for some planets. Now 
K depends on r, and r is proportional to K. In Tantrasangraha and Yukti-bhasa, an 
iterative procedure is given to find K. K can be found non-iteratively also, using an 
elegant geometrical method [16]. The exact expression for the manda-hypotenuse, 
K is: 


R2 
(4) 


JR? — rẹ sin? M — ro cos M 


Thus K is determined in terms of known quantities. This exact expression is attrib- 
uted to Madhava by Nilakantha. As r is not a constant, the planet's orbit is not a l 
circle, but some kind of an oval. 

It would be useful to summarise the definitions of various relevant quantities at 


Ke = 


this stage: 
mean longitude(madhyama-graha) = rô Po = róP = Om, | : 
apside (mandocca)=TOA = w, ! 
anomaly (manda-kendra) = AÔP = PÊĤQ = by- , I 
manda-hypotenuse (manda-karna), K = OP, 
manda-corrected longitude (manda-sphuta),9@4s = TOP, 


manda-correction, manda-phala = Oms — Oy. 
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Taking (3) into account, (1) gives 


ro sin(@yy — u) ART sin M (5) 
R m RE » 


sin(ys — m) = — 


Thus, one does not need to know the manda-hypotenuse in order to find the 
correction Qs — Oy. As the mean longitude, Om which varies linearly with time 
can be readily found at any instant, and apside, w is known, the ‘manda’ -correction 
(‘mandaphaia’), Oms — Om can be found from the above equation. Adding this to 
the mean longitude 0x, we obtain the ‘manda’-corrected planet, Oms. This is the 
true longitude or the *spAiutagraha! in the case of the Sun, and essentially so also 
for the Moon. 

The manda-corrected planet is the true heliocentric longitude for the planets 
Mercury, Venus, Mars, Jupiter and Saturn called 'taragraha's, as we will see shortly. 
One more correction, namely ‘sighra’-correction ("Sighra-samskára °) has to be 
applied to them to obtain their geocentric longitudes. 

When $ « 1, sin(ys — 0m) © Oms — Om and (5) reduces to 


ro sin(Oy — wW) ETE sin M 
R 5 R 


Qus — Ou = — (6) 
Comparing this with (35) in Appendix 1, it can be seen that this has the same form 
as the equation of centre correct to lowest order in eccentricity in Kepler's model, 
with ṣẹ playing the role of eccentricity, e. This equivalence is only to the first order 
in e. The equation of centre has different forms in higher orders of e in Kepler's 
model and Indian models. 

In Ptolemy’s model for the equation of centre, the planet is taken to be moving on 
the ‘eccentric circle’ with O' as the centre alright, but it would be moving uniformly 
not with respect to O’, but with respect to a point which is at a distance 2 O O' from 
O in the direction of O’ [17]. This point is called the “equant’. Ptolemy's model 
also would be equivalent to Kepler's model to the first order in eccentricity. 


Sighra-samskara: Conversion to the Geocentric Frame 


Exterior Planets 


The following explicit geometrical construction is described in Yukti-bhasá to obtain 
the geocentric longitude of an exterior planet [18]. The final formula is essentially 
the same as in the earlier texts. 

In Figure 2, O is the centre of the earth (*Bhagola-madhya'). The 'srghra' epicy- 
cle (‘Sighra-nicocca-vrtta’, or Sighra-vrtta’) - is a circle with O as the centre, and 
whose radius, rs is given. The “sighrocca’, S is located on this circle. It is also 
stated that sighrocca is the mean Sun (*aditya-madhyama'). The manda-epicycle 
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Figure 2: Sighra-samskara for exterior planets to obtain the geocentric 
longitude. 


is a circle with the sighrocca as the centre. The planet's apside (mandocca), U 
is located on this circle. The planet P is located on the eccentric circle which is 
centered at U. K = SP is the manda-hypotenuse. PST is the manda-corrected 
planet, which is the true heliocentric longitude. PSB which is the difference 
between this and sighrocca is known as Sighra-anomaly (sighra-kendra). TO P 
is the ‘sighra’-corrected planet (“sighra-sphuta’). The Sighrocca, S is the mean 
Sun, and the sighra-epicycle represents its orbit around the earth, O. The manda- 
corrected planet is the true heliocentric longitude of the planet around mean Sun, 
and the sighra-corrected planet, [OP is the geocentric longitude of the planet 
with respect to the earth, O. It is in this sense that the sighra-samskara is a coor- 
dinate transformation, which converts the heliocentric longitude to the geocentric 
longitude. 

The sighra-corrected planet is found in the same manner from the manda- 
corrected planet , as the latter is found from the mean planet. Thus it may be 
noted that, in the computation of the sighra-corrected planet, the stghrocca and 
the manda-hypotenuse will play the same roles as the apside and the the radius of 
the eccentric circle (R) did in the computation of the manda-corrected planet. The 
Sighra-hypotenuse, K; = O P can be determined in terms of SP = K. Apart from 
the similarities, there is one difference. In the manda-correction, we had noted that 
the radius of the epicycle r increases or decreases in the same way as K. In the 
Sighra-correction, the radius OS = r,, does not vary with the sighra-hypotenuse, 
Ks. To start with, both the mean radius rọ of the manda-epicycle and the radius r, of 
the sighra-epicycle are specified in the measure of the radius of the eccentric circle, 
being trijya or R = 3438’. 
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We now employ the following notation: 


mean longitude (madhyama-graha) — TÜP = Oy, 
manda-corrected planet (manda-sphuta) =TSP = Oys, 
manda-hypotenuse (manda-karna)=SP = K, 
K 
radius of the manda-epicycle = US = Toe cr 
mean Sun (sighrocca) = TOS = Os, 
radius of the sighra epicycle = OS = ry, 
Sighra-anomaly (sighra-kendra) = PSB=YSP-YTSB = Öms — Os, 
Sighra-hypotenuse (sighra-karna)= OP = K,, (7) 
Sighra-corrected planet (Sighra-sphuta) =TOP = Qo. (8) 


Here the manda-corrected planet, 0,75 and the manda-hypotenuse, K are determined 
in terms of the mean longitude (@j;), the apside (w), and the prescribed value of 
ro, as described in the previous sub-section?. The following formula for 0, can be 
easily derived from the geometrical construction in Figure 2: 


e H 0 Le 

sin(, — ys) = Aus n 
S 

rs Sin(Oys — Os) 


c: : 9 
[(K + rs cos(Qus — 05)? + r2 sin? (Oms — 05)]!2 d 


This is the formula in earlier texts also, such as Saryasiddhanta, Mahabhaska- 
riya, etc. This formula of Yukti-bhasà is exactly the same as would follow from 
the Keplerian model (see (41), of appendix 1) for an exterior planet, when 05s 
in the former is identified with the true heliocentric longitude in the latter (6j), 
and % in the former is identified with p in the latter. The identification of O45 
with 6h is clear enough, as 0s is the heliocentric mean planet to which the cor- 
rection due to eccentricity has been added. There is a small error as Ó, is the 
mean Sun and not the true Sun. This error is present in Ptolemy's Almagest [2] 
also, and persists even in de Revolutionibus of Copernicus [3]. In Table 2, we 
compare the average values of 4 which are the ratios of the earth-planet and sun- 
planet distances in the geometrical model with the ratios of the actual physical 
values of these. Clearly there seems to be reasonable agreement between these 
values. 


3Note that the radius of the manda-epicycle, U S itself is r = ro K e 


CC-0. Gurukul Kangri Collection, Haridwar 
EE CERE L 


Digitized by Arya Samaj Foundation Chennai and eGangotri 


Planetary and Lunar Models 365 


A. S Mean Sun 


oO 


Manda-wrtta 


Figure 3: Sighra-samskara for interior planets to obtain the geocentric longitude, 
in Aryabhatiya. 


Interior Planets 


In both the Indian and Greek traditions, the mean Sun was taken to be the mean 
planet for the interior planets. This is due to the fact that the directions of Mercury 
and Venus are close to that of the Sun, as viewed from the earth. So, their geocentric 
longitudes would not be very different from the longitude of the Sun. The procedure 
for the computation of the geocentric longitude of an interior planet in Aryabhatiya 
and other texts prior to Tantrasargraha, is represented geometrically in Figure 3. 
Here S is the mean Sun moving in a circle around U, which is the apside of the 
planet, where OU is equal to the radius of the manda-epicycle of the planet. Thus, 
S is nothing but the mean Sun, to which the manda-correction pertaining to the 
planet has been applied. P is the stghrocca which is actually the mean heliocentric 
planet, moving on the sighra-epicycle, which is the planet’s orbit around S.* Then 
what is computed as the geocentric longitude of the inner planet in the earlier 
texts, is actually TÔ P. This is obviously wrong. The manda-corrected planet ros 
here does not correspond to anything physical. The same mistake was committed 
by Ptolemy in his Almagest [2] and even by Copernicus [3], who seems to have 
followed Ptolemy closely. 

Nilakantha Somayaji seems to have been the first astronomer who revised this 
picture in Tantrasangraha and his other works [7,8]. His revised formulation is what 
is described in Yukti-bhasa [19] and depicted here in Figure 4. Nilakantha has a 


^In Ai ryabhatiya and other ancient texts, for the interior planets, the sighrocca and the sighra-epicycle 
represent entirely different things, when contrasted with the same objects for exterior planets. For the 
latter, the sighrocca was the mean Sun and the Sighra-epicycle was mean Sun's orbit around the earth. 
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Figure 4: Sighra-samskara for interior planets in Tantrasarigraha and Yukti-bhasa. 


unified formulation for the interior and exterior planets, where the true heliocentric 
longitude is converted to the geocentric longitude correctly. Here the mean Sun S is 
the sighrocca, in the same way as for exterior planets. $ moves on a circle of radius 
R with the centre of the earth, O as the centre. This is the Sighra-circle (Sighra-vrtta 
or the sighra-nicocca-vrtta) for the planet. The planet P moves on an eccentric orbit 
around S. The planet’s distance from S, SP is the manda-hypotenuse with its value 
reduced from K tor, = K7 5 TSP is the manda-corrected planet, which is the 
longitude of P measured with respect to S. This is the true heliocentric longitude of 
the planet. In Figure 4, we have also shown the apside (mandocca), U. U P = r, and 
US = *r, which is the reduced radius of the manda-epicycle. The smaller circle 
in the figure is the instantaneous orbit of P around S, whose radius is 7,. TÔ P is the 
sighra-corrected planet, which is the geocentric longitude . We now summarise the 
various relevant definitions: 


mean planet (madhyama-graha) - TÜP = Oy, 
mean radius of the planet's orbit = UP = r,s, 
mean Sun (Sighrocca) 2 TÓS = ðs, 
manda-corrected planet (manda-sphuta) = TSP = 6ys, 


5The determination of the manda-hypotenuse, K has been explained above in the sub-section on 
manda-samskara . 
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mS - 5 K | 
Planet’s distance from S, reduced manda-hypotenuse = SP —7, = r, ri 
Distance between apside and Sighrocca= US = z^ 
Sighra-anomaly (sighra-kendra) = PB =TP -TB = us — 49s. 
Sighra-hypotenuse (sighra-karna)= OP = K,, (10) 
Sighra-corrected planet (Sighra-sphuta)=TOP = 0,. (11) 


In the case of the exterior planet, the mean Sun which is the Sighrocca moves in 
a smaller circle of radius rs and the planet moves in a circle of radius equal to trijya, 
R around the apside. Here for an interior planet, the mean Sun is the sighrocca, 
which moves in the bigger circle of radius R, and the planet moves in an eccentric 
orbit, such that it is at a distance r, from U, and F, from S. The manda-corrected 
planet , Oms is calculated from Oy, in the usual fashion. The sighra-hypotenuse, K, 
is given by 


K, = [(R + F, cos(@srs — 85)) + (P, sin? (Oms — 8s)]". (12) 


Then the geometry of Figure 4 implies the following formula for the true geocentric 
longitude, 0,: 


EEN, 
qu 2) = Fs Ssin(Oys s) 


K, 
i Fs sin(Üys — 0s) (13) 
~ [(R+F, cos@ys — Os)? + F2 sin? (Oms — 0,)]?* 


This is the same as (46) in appendix 1 for the geocentric longitude of an interior 
planet in Kepler's model, if we identify 9,75 here, with the true heliocentric longitude 
0j, there, and Z in the above geometrical picture with £ in Kepler's model. Again, 
it is clear from the figure that 9,5 is the true heliocentric longitude, if we ignore the 
difference between the mean Sun and the true Sun. Ignoring the eccentricity of the 
inner planet’s orbit, we compare the average values of which are the ratios of the 
Sun-planet and earth-planet distances in the Tantrasarigraha model, with the ratios 
of the modern physical values of these in Table 2. There is reasonable agreement, 
just as for the exterior planets. 


Latitudes of Planets 
Just as in the case of longitudes, we also have an unified theory of latitudes in 


Yukti-bhasa [20]. The geometrical picture for the latitudinal deflection is descibed 
as follows in the text [21]. 
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Table 2: Comparison of 5 in Tantrasarigraha for 
Sighra-samskára with the modern values of the mean ratio 
of Earth-Sun and planet-Sun distances for exterior planets 

and the inverse ratio in the case of the interior planets. 


Planet Tantrasangraha value Modern value 
Mercury 0.387 
Venus 0.723 
Mars 0.656 
Jupiter 0.192 
Saturn 0.105 


“In the above set up, the sighra-vrtta is presumed with its centre at the centre 
of the apakrama circle Sand its circumference along the marga (in the plane) of 
the apakrama circle. It may be recalled that the apakrama circle near the centre is 
called sighrocca-nica-vrtta. The size of the sighra-vrtta-s will be different for the 
different planets. That is all the difference (between the srghra-vrtta-s) and there is 
no difference in their placement as they are located the same way (i.e., with their 
centre at the centre of the apakrama circle and also lying in the same plane). 

"Now, the manda-nicocca-vrtta is a circle having its centre on the circumference 
of the Sighra-vrtta at the point where the mean Sun is. This is the case for all 
(the planets). The ascending node (pata) has its motion along the circumference 
of the manda-nicocca-vrtta in the retrograde manner. The point in the manda- 
nicocca-vrtta where the pata is, will touch the apakrama-mandala. One half of the 
manda-nicocca-vrtta, commencing from the pata will lie on the northern side of 
the apakrama-mandala. Again, the point which is six signs away from the pāta 
will touch the apakrama-mandala. The other half (of the manda-nicocca-vrtta) 
will lie on the southern side of the apakrama-mandala. Here, that point, which is 
displaced maximum from the (plane of the) apakrama-mandala, will indicate the 
maximum viksepa (parama-viksepa) of the planets in terms of the minutes of arc 
of their respective mandocca-vrtta-s. Further, the plane of this nicocca-vrtta itself 
will be the plane of the pratimandala. Hence, the pratimandala too will be inclined 
towards the north and south from the plane of apakrama-mandala in accordance 
with the nicocca-vrtta. The manda-karna-vrtta! will also be inclined accordingly. 
Now, the viksepa has to be obtained from the manda-sphuta. 

"Here, since the centre of the manda-karna-vrtta is the same as the centre of the 
mandocca-vrtta and since it will be inclined to the plane of apakrama-mandala, 
south and north, accordingly as the mandocca-vrtta, the maximum divergence of the 
circumference of the manda-karna-vrtta from the plane of the apakrama-mandala 
will be the maximum viksepa in the measure of the manda-karna-vrtta. Hence, if 


Secliptic. 
TA circle whose radius is the manda-hypotenuse. 
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Figure 5: The latitudinal deflection (viksepa) of a planet . 


the Rsine of the manda-sphuta minus pata is multiplied by the maximum viksepa 
and divided by trijyd, the result will be the ista-viksepa of the planet on the manda- 
karna-vrtta. This inclination (deflection from the ecliptic) is called viksepa.” 

This has the following implication for the geometrical model for the planets. 
In Figure 2 (for the exterior planets) and Figure 4 (for the interior planets), the 
mean Sun, S (sighrocca) moves on the Sighra-circle in the plane of the ecliptic 
(apakrama-mandala), with the earth, O as the centre. The planet, P moves around 
$ inan eccentric circle with the apside, U as the centre, in a plane which is inclined 
to the ecliptic. SU and U P are in this plane. First, let us consider the heliocentric 
longitude. In Figure 5, O is the earth and S is the Sun moving around the earth in 
the plane of the ecliptic. P is the planet moving in a plane inclined to the ecliptic, 
and K = SP which lies in this plane is the manda-hypotenuse. The ecliptic, and 
the orbit of the planet intersect at N, which is the ascending node. 

The latitude of the planet is 8 = P Q whichis the arc perpendicular to the ecliptic 
passing through P. P B is perpendicular to the plane of the ecliptic. Then the the 
distance of the planet from the plane of the ecliptic,termed 'viksepa' is given to be 


viksepa = v = PB = K sin f. (14) 
P itself is found from the following expression: 
Rsinf = R sini sin(@ys — Ox), 


where ĝus is the true heliocentric longitude of the planet (rmmanda-corrected planet), 
y is the longitude of the ascending node of the planet, and i is the inclination of 
the orbit. Figure 6. shows the relation between the geocentric latitude f and the 
heliocentric latitude f as explained in Yukti-bhasa. We have 


PB = OPsinf = SP sin f, (15) 
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Figure 6: Geocentric latitude, f. 


Or, 
TENE 
Sins A (16) 
When £ and f are small, the above relation reduces to 


f= p.—. (17) 


OP is the earth-planet distance (bhi-tara-graha-vivara), while SP is the Sun- 
planet distance. Equation (17) is exactly the same as the relation (51) in Appendix 1 
on Kepler’s model, relating the geocentric latitude 2g and the heliocentric latitude 
f... Here it may be pertinent to recall that the theory of latitude in Ptolemy's Almagest 
was totally unsatisfactory, as it took the planes of the orbits to be passing through 
the earth rather than the Sun [2]. The same was the problem with de Revolutionibus 
of Copernicus also [3]. 

The latitudinal deflection affects the longitude also. The manda-hypotenuse, 
SP — K, should be projected onto a plane passing through the planet, P which 
is parallel to the ecliptic. S'P = K cosf, which is the projection is termed 
viksepa-koti (See Figs.(5) and (6)). It is pointed out that the geocentric longitude, 
Sighra-corrected planet, should be calculated taking the viksepa-koti as the manda- 
hypotenuse. The result is the true planet (graha-sphuta). This is the “reduction to 
the ecliptic" which was taken note of by some of the Islamic astronomers and later 
discussed by Tycho Brahe in late sixteenth century. 

In summary, we may note the following important points as regards the devel- 
opment of planetary theory in Kerala texts: 


1. Thealgorithms for the geocentric longitudes of planets given in Aryabhatiya 
and other ancient texts more or less simulated the results of the Kepler model, 
but for the fact that the equation of centre for interior planets was wrong. 
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2. Nilakantha's revised formulation of the equation of centre for interior planets 
led him to an unified treatment of exterior and interior planets. The clearest 
articulation of this revised model is to be found in Yukti-bhasd. The planets 
move in eccentric orbits around sighrocca (mean Sun), which itself goes 
around the earth. The formulae for the geocentric longitudes and latitudes 
of planets here are essentially the same as in Kepler model. 


3. Ptolemy's model for planetary motion is more complicated. Apart from the 
error in the formulation of equation of centre of interior planets, the theory 
for planetary latitudes is totally off the mark in his Almagest, where Ptolemy 
gives different models for exterior and interior planets. He revised his model 
for latitudes in his Handy tables, and later in the Planetary hypothesis [22]. 
His formulation in the later work can be taken to be his last word on the 
subject. Here, in the case of the exterior planets, the eccentric (around the 
earth) is inclined to the ecliptic. The epicycle which represents the motion of 
the Sun is parallel to the ecliptic. The model for the interior planets is more 
complicated, but is a vast improvement over the same in Almagest. The final 
results for the latitudes in Planetary hypothesis are close to those following 
from the modern heliocentric theory, though they are complicated, and are 
not equivalent to Kepler's model in any simple manner." In fact, the same 
is true with respect to the Copernican model also, revolutionary as it was in 
treating the earth also as a planet. 


4. It should be mentioned that the geometrical model described in Yukti-bhasa 
is for each individual planet separately. It does not give an unified picture 
involving all the planets, to a single scale. 


5. Lunar Theory in Yukti-bhasa 


Yukti-bhása describes a geometrical picture of the lunar motion which also incor- 
porates the ‘second correction’ or the ‘evection’ correction [23]? This is indeed 
a very interesting geometrical picture, according to which the centre of the earth 
is not the same as that of the celestial sphere (bhagola), and the latter is in mo- 
tion with respect to the earth in such a way that it is always along the earth-Sun 
direction or opposite to it, but at a variable distance. Now, the manda-corrected 
Moon, calculated using the procedure described earlier, is taken to be the true 


SThe Handy tables and Planetary hypothesis seem to have been totally ignored (at least as regards 
the theory of latitudes) by all the astronomers in the Greco-European and Islamic traditions till the time 
of Kepler [22]. 

Iit may be noted that the geometrical picture of lunar motion presented in Yukti-bhdsdis very different 


from the model suggested recently by Duke [24], as a possible geometrical picture to understand the 
second lunar correction discussed by Mañjulācārya in Laghumānasa. 
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Figure 7: Geometrical model for the second correction to the Moon 
in Yukti-bhāsā. 


longitude with respect to the centre of the celestial sphere. A second correction 
has to be applied to the manda-corrected Moon to obtain the true longitude of the 
Moon with respect to the centre of the Earth, termed the ‘second-corrected planet’ 
(‘dvitiya-sphuta’). 

The procedure for the second correction is similar to the calculation of the 
manda-correction with the centre of the celestial sphere serving as the equivalent 
of the apside, and called just ‘ucca’, which is taken to be in the direction of the Sun 
. The distance between this and the centre of the Earth, which is the radius of the 
epicycle, is posited to be a continuously varying quantity given by 5 cos(@s — w) 
in yojana'°s, where 0s and w are the longitude of the Sun and the apogee of 
Moon (candrocca), respectively.!! According to the Aryabhatan school, the mean 
distance between the Moon and the centre of the earth is 10R — 34380 yojanas. 
The actual distance between the Moon and the earth will be 10K, where K is the 
manda-hypotenuse in minutes. 

In what follows, we ignore Moon's latitude. In Figure 7, O is the centre of the 
Earth, separated from the centre of the earth (C) by a distance 


R A : 
T2 = 2 cos(0s — w) (in yojana s). (18) 


T is the first point of Aries, and TOC = @s (Sun's longitude). The Moon is at P. 
HenceT C P = ĝus (manda-corrected Moon). Here Qs incorporates the correction 
due to equation of centre (manda-samskara), as described in section 4. The mean 


10, unit of distance in Indian texts. 
Il When cos(@s — w) is negative, the centre of the bhagola is at a distance E cos(@s — w )| from 
the centre of the earth, opposite to the direction of the Sun. 
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longitude, 0,, and the apogee w are not shown in the figure. CP = Dj = 10K, 
where K is the manda-hypotenuse in minutes. It is clear that OCN = us — Os. 

O P, the ‘second-corrected hypotenuse’ ('dvitiya-sphuta-karna' ) in yojands is 
the true distance between the Moon and the centre of the Earth. The bhujd-phala, 
ON and koti-phala, C N are given by 


ON = rssin(Üys — 65) 
E z cos(ĝs — w )sin(ĝms — 05), (19) 

and CN = ncos(@ys — u) 
= 3 cos(@s — a) cos(Oy5 — Gs). (20) 


Then, the true distance is given by 


OP=D, = V(PN)?+ ON? 
= V(PC+CN)?+ ON? 


= (D; + koti-phala)? + bhuja-phala* 


2 
= [C T  cos(bs — u)cos(Üys — 9) 
R 2|2 
+ G cos(@s — w )sin(Üys — 9) r (21) 


Here 


10R? 
PLD] E-—— 
V R? — r$ sin? M — ro cos M 


where M = 6x, — w is Moon's anomaly, and ro is the mean radius of the manda- 
epicycle introduced in the discussion on the equation of centre in section 4. 

We now explain how Yukti-bhasa uses the above geometrical picture of lunar 
motion to incorporate the so called ‘evection’ correction to the longitude of the 
Moon. In Figure 7, 6 = lÓP is the true longitude of the Moon with respect to 
the centre of the earth. The true position of the Moon at P can also be obtained by 
drawing O P’ = D; in the direction of the manda-corrected planet with respect to 
O and P'P = r; from P’, in the direction of the Sun. 


(22) 
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Now 
OPC = PÓP' -TÓP' - TÓP -TÓCP —TÓP = bys — 6. 


Hence 


2 cos(Os — u)sin(Üys — S) 


: E ^ ON 
sin(Oys —0,) = sin(O PC) = OP = D; , Q3) 


where D; is given by (21). Hence 


" 'R : 
. 4| 3 Cos(@s — a) sin(Oys — 8s) 
4 =Oms — sin [iste miam tn 
. _ı [rosin M 
=m -— sin (s ) 
R . 
5 cos(@s — w)sin(@ys — 0 
=a E (8s s MS 4 Q4) 


Now when x < l,sin^! x ~ x (radians) = 3438x (minutes). i for the Moon is 
specified to be E in Tantrasarngraha. Then to the first order in * and EA 2 


we 
R 
obtain the true longitude of the Moon to be 


1. 
20* 


0, (in minutes) = m — 300.82' sin M — 171.9' cos(@s — w) sin(ms — 0s), (25) 


where 8y is the Mean longitude of the Moon in minutes, the second term is the 
usual equation of centre and the other term is the second correction as prescribed in 
Tantrasangraha. As we mentioned ealier, the second correction appears already in 
Laghumanasa [10]. There it is stated to be - 143/58" cos(0s — w ) sin(@ys — 9s). It 
is clear that the lunar theory in Tantrasangraha/Yukti-bhdsd is very similar to that 
in Laghumanasa, though the parameter is altered. 

Now, according to the modern lunar theory, there are nearly 1500 terms contribut- 
ing to Moon's longitude! Most of them are negigible. Considering only the major 
ey Moon's true longitude, 9, in minutes is given by the following expression 
[25]: 


6,(in minutes) = Oy — 377.33' sin M + 12.82' sin(2M) 
76.43’ sin(2(04 — 8s) — M) + 39.5’ sin2(Qy — 9s)) 
+ 11.13’sin(@s — ws), (26) 


12We have recast the formula in terms of the longitude of the apogee, instead of the perigee as in 
modern notation. This will only change the signs of some terms. 
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where wgs is the longitude of Sun's apogee. In the RHS, the second and third terms 
represent the “Equation of centre’, and the fourth, fifth and the sixth ones are the 
"Evection', ‘Variation’, and ‘Annual inequality’ repectively. Part of the second term 
and the fourth term can be combined using the relation: 


A+B A-B 
sin A + sin B = 2sin ( i ) os (557). 


Then, writing only the terms corresponding to the two inequalities, namely, “equa- 
tion of centre’ and 'evection', 


O0,(Modern) = 6y — 300.9 sin M + 12.82 sin(2M) 
— 152.86cos(@s — w)sin(@ys5 — 05) + Variation 
+ Annual inequality + ..... (27) 


We may now compare the expression for 0, in Tantrasangraha and Yukti-bhasa to 
the first order in 2, and A with the modern expression, and with the one which 
follows from the geometrical model of Ibn al-Shatir (outlined in Appendix 3). The 
coefficient of the sin M term in Tantrasangrahal Yukti-bhasa formula (25) has very 
nearly the same value as in the modern expression (27). Ibn al-Shatir’s value for 
the same is 296.05'. The evection term in (25) has the same form as in the modern 
expression , though with a higher coefficient of 171.9’. In fact, the Laghumanasa 
value of 143/58” seems to be better. In Ibn al-Shatir’s theory this has the value 
162.35'. It should however be remembered that we have retained only the first 
order terms in the Tantrasarigraha/Yukti-bhàsa expression, as also in Ibn al-Shatir’s 
theory. There is no sin(2M) term in Tantrasargraha/Yukti-bhasa. This is true even 
if we include the higher order terms in 2. However the coefficient of this term is 
small even in the modern theory. 

It is possible to compare the exact values of the correction to the mean longi- 
tude due to equation of centre and evection, without any approximation, at SyZygy 
(Qj — 0s = 0° or180*) or quadrature (Om — @s = +90°), when it is also assumed 
that the anomaly, M = 90°. At syzygy, the second correction is zero in 
Tantrasargrahal Yukti-bhasda, and when the anomaly is 90*, 9, — ðm = sin7! (5) = 
301.2’, compared with the modern value of 300.9’, Ptolemy’s value of 301’ and Ibn 
al-Shatir’s value of 295’. At quadrature, when the anomaly is 90°, it is easy to check 
that D, = 10.038R and D; = 10.05R. Then 8, — 8u = sin~! (4) —sin-! (4) = 
472.3’, compared with the modern value of 453.8’, Ptolemy's value of 460’ and Ibn 
al-Shatir’s value of 456’. 

Hence, the Tantrasarigrahal Yukti-bhasa value for the maximum correction due 
equation of centre and evection is very accurate at syzygies, and reasonable at 


quadratures. 
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From (22) and using the value of 2 = Us we find that the manda-hypotenuse 
in yojanás, Dj, has the values 10.9589, 10.0385 R and 9.1954 R when M is 0° 
(Moon at apogee), 90° and 180° (Moon at perigee), respectively. 

At syzygy, ms — 0s = 0° or 180°. Then from (21), D; = Dı + 4 cos M, and it 
can be easily seen that the true distance of the Moon from the earth, D», has the values 
11.4589R, 10.0385 R and 8.6954R when M is 0° (apogee), 90° and 180° (perigee), 
respectively. Hence, at syzygy, the ratio of the maximum and minimum distances 
of the Moon from earth is 1.3178 according to TantrasaiigrahalYukti-bhasa. This 
ratio is 1.1925 in Ptolemy’s theory, and 1.1884 in Ibn al-Shatir’s theory. 

At quadrature, Oms — 0s = +90°. Then, D; = D; when M = 0° or 180°, and 


D? = NIDA + E when M = +90°. The true lunar distance D» has the values 
10.9589 R, 10.0509 R and 9.1584 R when M is 0° (apogee), 90° and 180° (perigee), 
respectively. Hence, at quadrature, the ratio ofthe maximum and minimum distances 
of the Moon from earth is 1.1918 according to Tantrasangraha/Yukti-bhasa. In 
Ptolemy's model this ratio turns out to be 1.3112 and in Ibn al-Shatir’s model, 
iL Sym 

Overall, the ratio of the maximum and minimum distances turns out to be 
1.3178, 1.9278 and 1.3112 according to TantrasarigrahalYukti-bhasa, Ptolemy's 
model and Ibn al-Shatir’s theory respectively. 

Thus itis remarkable that Indian astronomers, from the time of VateSvara (c.904) 
and Manjulacarya (c.932), arrived at a very accurate expression for the second lunar 
correction and this seems to have been based on a fairly reasonable geometrical 
picture of motion as revealed in the later Kerala works, Tantrasangraha (c.1500) 
and Yukti-bhasà (c.1530). 
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Appendix 1 
Geocentric Coordinates of a Planet in Kepler's Model 


The planetay models in ancient times can be appreciated better, if we understand 
how the geocentric coordinates of a planet is calculated in Kepler's model, which is 
correct in its essentials even today [13]. The three laws discovered by Kepler in the 


early seventeenth century form the basis of our present understanding of planetary 
orbits. The three laws are: 


1. Each planet moves around the Sun in an ellipse, with Sun at one of the foci. 
2. The areal velocity of a planet in its orbit is a constant. 


3. The square of the time-period of a planet in its orbit is proportional to the 
cube of the semi-major axis of the ellipse in which it moves. 


Kepler's laws can be derived from Newton's second law of motion and law of 
gravitation". If we take the time period of a planet as an input, Kepler's third law 
does not play any role for an individual orbit. We should note that Kepler's laws are 
essentially kinematical laws, which do not make any reference to the concepts of 
'acceleration' and 'force', as we understand them. Even then, they capture the very 
essence of the nature of planetary orbits and can be used to calculate the planetary 
coordinates, once we know the parameters of the ellipse and the initial coordinates. 
The planetary models in ancient astronomy are kinematical and in that sense they 
can be compared with Kepler's model. In the following, we shall elaborate on the 
computation of the geocentric longitude and latitude of a planet in Kepler's model. 


14Conversely, Newton's inverse square law of gravitation can also be arrived at from Kepler's laws! 
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Figure 8: Elliptic orbit of a planet around the Sun. 


Elliptic Orbits and Equation of Centre 


The elliptic orbit of a planet ( P) around the Sun (S, which is also one of the foci of 
the ellipse) is represented in Figure 8. a and b are the semi-major and semi-minor 
axes of the ellipse. I" refers to the first point of Aries. w = I SA is the longitude of 
the aphelion (A). 9, = T'S P is the heliocentric longitude of the planet. 

Here e is the eccentricity of the ellipse and / = a(1 — e?). 

Then the equation of the ellipse (in polar coordinates, with the origin at one of 
the foci), may be written as 


: = l — e cos(a — W). 


Therefore, 


r =I[1 +ecos(, — w)] + O(e°), be 
r? = PIL + 2ecos(, — w)) + Ole?) — — 
Cou . AMG. 

Now the areal velocity of the planet at any instant r^ó 
second law, this is a constant. As the area elli 
also be written as 7% = 99^, where T is the 
angular velocity of the planet. Hence, 
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Using the expression for r? in (30), we find 
l*8,[1 + 2e cos(O, — »)] = cab + O(e?). (32) 
Now! =a (1 — e?) = a + O(e*) and ab = a? + O(e?). Hence, 
G, [1 + 2e cos(O, — w)] © o, (33) 
where the equation is correct to O (e). Integrating with respect to time, we obtain 
On + 2esin(0, — w)] ^ ot, (34) 
or again, 
On — Om = 0, — ot = —2e sin(@y — w) + O(e)). (35) 


Here 0m = ot is the mean longitude which increases linearly with time, t. cot — w, 
the difference between the longitudes of the mean planet and the apogee is the 
‘anomaly’. Eq.(35) gives the equation of centre which is the difference between the 
true heliocentric longitude 0j, and the mean longitude ĝm, correct to O(e), in terms 
of the anomaly. Clearly, the equation of centre arises due to the eccentricity of the 
orbit. 


Geocentric Longitude of an Exterior Planet 


The orbits of all the planets are inclined at small angles to the plane of Earth's orbit 
around the Sun, knownas ecliptic. We will ignore these inclinations and assume that 
all the planetary orbits lie on the plane of the ecliptic while calculating the planetary 
longitudes, as the corrections introduced by them (inclinations) are known to be 
small. We will consider the longitude of an exterior planet like Mars, Jupiter or 
Saturn and an interior planet like Mercury or Venus, separately. 

The elliptic orbit of an exterior planet (P) and that of the Earth (E) around the 
Sun (S) are shown in Figure 9. Here, 6, = I'$P is the true heliocentric longitude 
of the planet. 0; = TES and 0, = TEP are the true geocentric longitudes of the 
Sun and the planet respectively, while r and R are the distances of the Earth and the 
planet from the Sun, which vary along their orbits. To facilitate comparison with 
the Indian models, EP’ = R is drawn parallel to SP. Then P P is parallel to ES 
and P'P = r. We have already described how 0, is computed using the expression 
for the equation of centre. From this, the true geocentric longitude, 0, has to be 
computed. 

Now 


EPS = PEP = 0; — 6, (36) 
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Figure 9: Heliocentric and geocentric longitudes of an exterior planet in 


Kepler’s model. 


and 
ESP = 180° — (05 — Oh). 
EP? = R? + r’ — 2r R cos[180° — (0s — 64)], 
or, 
EP = [(R +rcos(@s — 6,)Y + r° sin (8s — 6,)] ^ 
Also, 


sin(EÊS) _ sin(ESP) 
BS” on WERE 


Using (36)-(40), 


; r sin(0s — 9) 
sin(0, = On) = 
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(37) 


(38) 


(39) 


(40) 


(41) 
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Figure 10: Heliocentric and geocentric longitudes of an interior planet in 
Kepler’s model. 


Here (s — 0,), the difference between the longitudes of the Sun and the helio- 
centric planet, is the ‘solar anomaly’. Thus, (41) gives 0, — 0, in terms of the solar 
anomaly. Adding this to 0,, we get the true geocentric longitude, 0, of the planet. 


Geocentric Longitude of an Interior Planet 


The elliptic orbit of an interior planet (P) and that of the Earth (E) around the 
Sun are shown in Figure 10. Here, 0, = ISP is the true heliocentric longitude of 
the planet, which can be computed from the mean heliocentric longitude and the 
equation of centre. 0 = TES and 0, = EP are the true geocentric longitudes of 
the Sun and the planet respectively. Hue r and R are the variable distances of the 
planet and the Earth from the Sun, respectively. 


Now 
SEP = 60, — bs. (42) 
It can be easily seen that 
ESP = 180° — (8, — 9s), (43) 
EP = [(R + r cos(8 — 05)? +r? sin?(6, — 4s)]'”. (44) 
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planet's orbit 


Figure 11: Heliocentric and geocentric latitudes of a planet in Kepler's model. 


Also, 
sin(S E P) T sin(ESP) (45) 
SP EP 
Using (42)-(45) we get, 
in(@, — 0. 
sin, — 65) = rines = 2s) (46) 


ICR + r cos(, — Os)? + r? sin (B — 0$)]!? 


The difference (9, — 05) is determined from this equation. Adding this to Os, we 
get the true geocentric longitude, 0, of the planet. Note that the true longitude of 
the Sun, O5, and the true heliocentric longitude of the planet, 0, (obtained by adding 
the equation of centre for the planet to the mean heliocentric longitude longitude) 
should be obtained first. Then the true geocentric longitude of an interior planet can 
be obtained using (46). 


Heliocentric and Geocentric Latitudes of a Planet 
In Figure 11, the orbit of the planet P is inclined at an angle i; to the ecliptic. N 


and N' are the nodes. P P' is the circular arc perpendiular to the ecliptic. Then the 
heliocentric latitude fs is given by 


fs = PSP =. (47) 


If Ap and Ay are the heliocentric longitudes of the planet and the node, it can be 
easily seen that 


sin Bs = sini, sin(Ap — Ay), (48) 
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Figure 12: Ptolemy's model of lunar motion. 


or, 
Bs © in sin(Ap — Ay), (49) 


as ij, and Js are small. Note that 4 p stands for the true heliocentric longitude of the 
planet. We have also shown Earth's orbit in the figure. The geocentric latitude £g is 


given by 
» JEN. 
= PEP = —. 50 
PE EP (50) 
From (47)-(50), we find that 
SP 
PE = bsap 
LA SP BUNTEN) (51) 
EP 
where £ P, the true distance of the planet from the Earth, can be found from (39) or 
(44). 
Appendix 2 
Ptolemy’s Lunar Theory 


Figure 12 depicts the geometrical model of Ptolemy which incorporates both the 
corrections, namely, ‘equation of centre’ and ‘evection’ [26]. Here O is the centre 
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of the earth and C is the centre of the circle with radius R, at a distance r; from O 
in the direction OA to be specified shortly. B is the mean Moon, whose geocentric 
longitude Qj, = COB increases uniformly with respect to O and not C, though it 
is at a constant distance from C. The direction of O A is chosen such that AO B = 
2(0y; — 0s), where 8s is the mean longitude of the Sun. Draw a circle of radius r with 
B as the centre. This is the epicycle corresponding to the equation of centre. The 
true Moon P is located on the epicycle such that B P is in the direction of Moon's 
apogee. Then the true longitude of the Moon, 6, = TOP. This is a combined 
*eccentric- -epicycle’ model. Extend OB to OQ, such that PQ is perpendicular to 
OQ. Then PBO = Oy — w = M is the anomaly, where ur is the longitude of 
Moon's apogee. 
The difference between the true and mean longitudes of the Moon, Oy — 0, = 
[OB —TÓP = PÓB, satisfies the relation: 


OP sin(@y —0,) = PQ =r sin M. (52) 


Now OP = /(OB+rcosM)? +r? sin? M. (53) 


Here OB is to be determined. Draw CN perpendicular to OB. As CON = 2(6y — 
05), we have CN = r;sin2(0, — 05) and ON = r5cos2(0y — Os). Noting that 
NB —4CB? — CN?, with CB = R, we have 


OB = NB + ON = JR? — ri sin2(0y —05)) + r2 cosQ(Qy —9s)). (54) 


Then the correction to the mean longitude, 0, — Oy, is to be determined from 


in M 
sin@y.—6,), = —— EEE (55) 


OB +rcos M 3-r?si M. 


where OB is given by (54). 

The above equation (55) incorporates both the ‘equation of centre’ and *evection' 
terms. The expression for 6, — 05; does not reduce to the modern form (27) for the 
sum of these terms, in any reasonable approximation. However at syzygy, the 
correction has the same form as the sin M term in the equation of centre with 
OB = R+rz replacing R. The second correction would be zero at quadrature also, 
when M — 0? or 180? (at perigee or apogee of the epicycle), just as in the modern 
theory. The second correction has the maximum value when M = +90°. 

Ptolemy gives the maximum value of the total correction to be 301° at syzygy, 
compared to the modern value of 300.9'. At quadrature the maximum is given to 
be 460’ compared to the modern value of 453.76’. At other values of 04; — @s and 
anomaly, Ptolemy's theory would not be impressive, as far as the correction to the 
mean longitude is concerned. 
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The values of E and B can be calculated from the maximum corrections at the 
syzygy and quadrature. From (54) and (55), it can be easily checked that 


at syzygy, tan(0y = Op) Macimum = R 4+ T, (56) 
A 
and at quadrature, tan(Oy — O,)|naximum = z (57) 
x 
From the given values of the maximum values, we find that £ = 0.1063 and 


z = 0.2106 in Ptolemy's model. 


Variation in Earth-Moon Distance 


Now at syzygy, OB = R + r;. Then OP = R 4 r2 +r, when M = 0° (apogee of 
the epicycle), and OP = R + r2 — r, when M = 180° (perigee of the epicycle). 
Similarly, at quadrature, OB = R — r2. Then OP = R — rm +r when M = 0° 
(apogee of the epicycle), and OP = R — r2 — r when M = 180° (perigee of the 
epicycle). Using the values of 7 and 2 above, we find that the ratio of the maximum 
and minimum distances of the Moon from earth is 1.1925 at syzygy, 1.3112 at 
quadrature, and 1.9278 overall. This is of course absurd. Hence, Ptolemy's model 
can be hardly taken to be a serious physical model. This is one of the reasons for 
the search for an alternate lunar theory in the later Islamic tradition of astronomy, 
which otherwise followed Ptolemaic framework. 


Appendix 3 
The Lunar Theory of Ibn al-Shatir 


Ibn al-Shatir (1304-1375/6 CE) developed a lunar theory in his Kitab Nihayat as-Sul 

fi Tashih al-Usül , where he dispenses with the eccentric deferent in Ptolemy's theory 
and introduces instead a second epicycle [10]. His model is illustrated in Figure 13. 
Here O is the centre of the earth. The mean Moon moves on a deferent circle of 
radius R. Consider the case when the mean Moon, mean Sun and Moon's apogee 
are in the same direction. Here the mean Moon is at A (OA = R). Draw a circle of 
radius a around A, which is the epicycle corresponding to the ‘first inequality’. A; 
is apogee of the epicycle. Draw a second epicycle of radius b around A;. Then the 
true Moon is at B which is the perigee of the second epicycle. 

Consider the case when the mean Moon is at Po, and the anomaly, A Ó Py = M. 
Draw PoA2 = a in the direction of Moon’s apogee. Draw A5P = b, such that 
PoA2P = 2(0y —65),thatis, the radius of the second epicycle rotates at twice the rate 
of elongation of the mean Moon from the mean Sun, eastwards (counterclockwise) 
from the direction of the apogee. Then the true Moon is located at P. Draw T Q 
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Figure 13: Ibn al-Shātir’s model for the two corrections to the Moon. 


through P and PọQ; perpendicular to OA and PọA2. Let ó be the angle between 
the directions of the true Moon (O P) and the apogee (O Pj). Let 0, be Moon’s true 
longitude, and 69 = 6, — y be the total inequality. Then the ‘total inequality’ 00 
can be expressed as 


60 = 0, -0y = —P Ô Po = ġ — M (58) 


We derive the expression for the total inequality in the above geometrical model, 
and simplify it when only terms which are first order in and L are retained. This 
will facilitate comparison with the modern expression for the lunar inequalities 
given by (26) and (27). Now, 


TQ = PQ: = RsinM, 
TP = As3Psin(2(04 —06,) = bsin(2(@y — 05)). 
(59) 
Hence, 
PQ=TQ-TP = RsinM —bsin(2(0y — 0s)) (60) 
AT = A2P cos(Q(Q4 —0,)) = bcos(2(@y — 0s), 
PoAr = a, 
QQ, = PT = PoA2— ArT = a—bcos(2(Oy — 05)), 
OQ, = RcosM, 
OQ2=0Q0,+02Q0, = RcosM +a —bcos(2(@y — 05)). (61) 
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Hence 


PQ R sin M — b sin(2 (0m — 05)) 


tan($) = — = . 
OQ RcosM +a — bcos(2(0y — 8s)) 


(62) 


Then ó0 is found from (58), where ¢ is computed as above. 
To the first order in $ and b, 


á PER ] a — bcos(2(0y — 0$)) 
ta x M — — 2(0m — 05))] ——[1 — 
an($) [sin R sin(2(8y M— i Ecos M 
1 
x tanM-— Roem sin M + b sin M cos(2(@y — 05)) 
— bcosM sin(2(0, — 05))] 
l am DES 
= tanM — cog MR sin M 4- R sin(2(0y — 0s) — M)]. (63) 


When the total correction is small, 


$ — M 7x tan($ — M) ~ (tang — tan M)cos? M. 


DN. 
Q A Oe 5 sin M + — sin(2@y — 65) — M)] 


a—b 


2b 
= m-i sin M + R sin(O4 — @s)cos(@s — w )], (64) 


where w is the longitude of Moon's apogee ( M = 0 — w). 
Comparing the above with the modern expression (27) for the lunar inequalities, 
we notice the absence of the sin(2M) term in Ibn al-Shatir’s theory. 7 and H are 


given to be S and Le in his model. Expressing the result in minutes, 
0, X Om — 296.05' sin M — 162.35' sin(8m — 05) cos(@s — w). (65) 


This is to be compared with the values 300.9 and 152.864 for the coefficients of 
the sin M and the other term in the modern expression (27). 

It is possible to compare the exact values of the maximum correction to the 
mean longitude due to equation of centre and evection, without any approximation, 
at syzygy (0m — 0s = 0° orl80°) or quadrature (0m — 0s = +90°), when the 
anomaly, M = 90°. With the given values of a and b, the maximum correction 
(Om —0, = M — 9) has the values 295’ at syzygy and 456’ at quadrature, compared 
to the modern values 300.9' and 453.8' respectively. 
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Variation of Earth-Moon Distance 


From Figure 13, the distance of the Moon from earth, D is given by 
D? = PQ? + 09. (66) 
Using (61) and (62), we have 


D? = R*+a*+b?+2aR cos M —2bR cos(2(0y —65)— M)—2ab cos(2(0yy —4s)). 

(67) 
From this, it is easy to show that the maximum and minimum distances at syzygy 
are R + (a — b) and R — (a — b) respectively. Then, with the given values of a and b, 
the ratio of the maximum and minimum distances at syzygy is found to be 1.1884. 
Similarly, at quadrature, the maximum and minimum distances are R + (a + b) 
and R — (a + b). The ratio of the maximum and minimum distances at quadrature 
is then, 1.3077. This is also the ratio of the maximum and minimum distances, 
overall. Clearly, Ibn al-Shatir does manage to modify the Ptolemaic model so as to 
get rid of the totally unreasonable variation in the Earth-Moon distance which was a 
longstanding problem in the development of lunar theory in the Greek and Islamic 
traditions which followed the Ptolemaic framework. 
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